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ABSTRACT 

We present the first results from a survey of the relative spatial distributions of galaxies, intergalactic 
neutral hydrogen, and intergalactic metals at high redshift. We obtained high-resolution spectra of 8 
bright QSOs at 3.1 < z < 4.1 and spectroscopic redshifts for 431 Lyman-break galaxies (LBGs) at slightly 
lower redshifts. Comparing the locations of galaxies to the absorption lines in the QSO spectra shows 
that the intergalactic medium contains less neutral hydrogen than the global average within r 0.5/i _1 
comoving Mpc of LBGs and more than average at slightly larger distances 1 £ r 5/i _1 comoving Mpc. 
The intergalactic medium within the largest overdensities at z ~ 3, which will presumably evolve into the 
intracluster medium by z ~ 0, is rich in neutral hydrogen and CIV. The lack of HI absorption at small 
distances from LBGs appears unlikely to be produced solely by the Lyman continuum radiation they 
emit; it may show that the galaxies' supernovae-driven winds maintain their measured outflow velocities 
of ~ 600km s _1 for a few hundred million years and drive away nearby intergalactic gas. We present 
correlation functions of galaxies with Lyman-a forest flux decrements, with CIV systems, and with other 
galaxies. We describe the association of galaxies with damped Lyman-a systems and with intergalactic 
Hell opacity. A strong observed correlation of galaxies with intergalactic metals supports the idea that 
Lyman-break galaxies' winds have enriched their surroundings. 

Subject headings: galaxies: formation, galaxies: high-redshift, intergalactic medium, quasars: 
absorption lines 



1. INTRODUCTION 

The large-scale distribution of matter in the universe 
is well understood only at the earliest times, a few hun- 
dred thousand years after the big bang, when primordial 
photons strained against the opacity of matter and drove 
small acoustic waves throughout space. But though this 
era was appealingly simple, elegantly predicted, magnifi- 
cently observed, and so on, it was nevertheless brief. Par- 
ticles slowed as the universe cooled. Electrons became un- 
able to outrun the Coulombic pull of nuclei and vanished 
into atomic orbits. The universe became transparent; the 
photons of the microwave background hurtled through his- 
tory; and the baryons left behind, freed from the regulat- 
ing pressure of light, began to evolve according to their 
own complicated rules. Many acoustic waves had not even 
completed their first oscillation when the universe ceased 
to be described by the simple physics of linear fluid dy- 
namics. Baryons fell towards overdensities in the matter 
distribution. They crashed together and became shock 
heated. They cooled. Nuclear reactions ignited. Black 
holes formed. Intense radiation filled the universe. The 
final result of this chaos is known only because we can 
survey the wreckage that surrounds us. A small fraction 
of baryons were crushed into stars that now huddle to- 
gether in galaxies, burning through their measure of fuel, 
dying one by one. Most of the remainder were blasted to 
T ~ 10 5 -10 7 K, temperatures they had not experienced 
since shortly after the big bang, and were left to drift for a 
sterile eternity in the vast stretches of intergalactic space. 
Few goals in cosmology are more fundamental than un- 



derstanding the physical processes that transformed the 
large-scale distribution of baryons in this way. 

Many basic observations suggest that supernovae played 
a major role. The disruption of star formation by super- 
nova explosions is the favored explanation for why so few 
baryons are found in stars today (e.g., White & Rees 1978, 
Springel & Hernquist 2002). Numerical simulations can- 
not easily reproduce the large disk galaxies that we observe 
around us (e.g., Weil, Eke, & Efstathiou 1998) or the high 
temperature of intergalactic gas at redshift z ~ 3 (e.g., 
Cen & Bryan 2001) unless they include substantial heat 
input from supernovae. The shape of galaxy clusters' X- 
ray luminosity/temperature relationship differs from naive 
self-similar expectations in a way that suggests that su- 
pernovae may have imparted ~ 1 keV of energy to each 
of the young universe's nucleons (Kaiser 1991; Ponman, 
Cannon, & Navarro 1999). It is difficult to explain why 
the soft X-ray background is so faint and so dominated by 
AGN without asserting that supernovae blew apart dense 
clumps of baryons that would otherwise have produced co- 
pious free- free emission (e.g., Pen 1999). The scarcity of 
faint galaxies in the local universe relative to naive expec- 
tations from cold dark matter models is often attributed 
to the destruction of low-mass galaxies by numerous con- 
current supernovae (e.g., Cole et al. 1994). 

These examples are only a few among many. We are 
unable to account for much of what we observe around 
us without invoking the indistinct notion of strong super- 
nova "feedback," and our understanding of the evolving 
universe will remain seriously incomplete until we compre- 
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hend quantitatively how this feedback works. Here is what 
we know: a large fraction of the gas in starburst galaxies 
at low and high redshift appears to be flowing outwards 
rapidly enough to escape the galaxies' gravitational pull 
(e.g., Heckman et al. 2000; Pettini et al. 2001, 2002); 
galaxies that experienced intense bursts of star formation 
in the past now contain little interstellar gas (e.g., Mayall 
1958; Roberts 1972); and metals produced by stars can 
be found far from known galaxies (e.g., de Young 1978; 
Cowie et al. 1995; Mushotzky & Loewenstein 1997; Elli- 
son et al. 2000). Most attempts to understand how super- 
novae affect the universe are founded on the picture that 
these observations inspire: the numerous supernova ex- 
plosions in a young galaxy create an enormous blast wave 
that rips through the galaxy and lays waste to its sur- 
roundings. But working through the details of this pic- 
ture remains challenging even after 30 years of theoretical 
studies (e.g., Mathews & Baker 1971; Larson 1974; Ozer- 
noi & Chernomordik 1978; Ostriker & Cowie 1981; Dekel 
& Silk 1986; Ikeuchi & Ostriker 1986; Voit 1996; Nath & 
Trentham 1997; Mac Low & Ferrara 1999; Aguirre et al. 
2001; Cen & Bryan 2001; Madau, Ferrara, & Rees 2001; 
Scannapieco & Broadhurst 2001; Croft et al. 2002). It 
is still unclear, for example, which sorts of galaxies were 
responsible for seeding the intergalactic medium with met- 
als, or what effect blast waves have on galaxy formation 
and evolution, or even whether realistic blast waves would 
be physically capable of fulfilling the large role that they 
are assigned in the standard lore. 

These blast waves, or "superwinds," are not easy to 
study theoretically. Supernovae themselves are not well 
understood, and treating the propagation of their numer- 
ous overlapping shock waves into a galaxy's inhomoge- 
neous surroundings is difficult enough on its own; but the 
central problem is that we have little idea of the character- 
istic energy scale to associate with the winds that super- 
novae drive. The energy released by a single supernova, 
~ 10 51 erg, is no mystery, but the number of supernovae 
in a young galaxy is not easy to estimate theoretically or 
observationally, and it is unclear in any case how large a 
fraction of the energy released by supernovae is imparted 
to nascent winds. Much of it may be harmlessly radiated 
away by the dense gas it heats. Physical arguments and 
numerical simulations are at present incapable of estimat- 
ing a priori the energy of a galaxy's superwind to within 
even an order of magnitude. Yet the energy of the winds 
is largely what determines how large an impact they have 
on the evolving baryonic universe. Current theoretical ap- 
proaches usually amount to little more than treating the 
winds' energy as a free parameter that can be adjusted 
until supernovae have the desired impact on the rest of 
the universe — but the circular logic is unsettling to skep- 
tics who wonder if supernovae may be merely a convenient 
scapegoat for the failures of popular cosmogonic models. 
The role of supernovae is probably the biggest remain- 
ing gap in our attempt to account for the evolution of the 
baryonic universe since the time of recombination, and this 
is unlikely to change until observations can provide direct 
constraints on the strength of supernovae-driven winds. 

Inspired in part by these considerations, we began in 
the spring of 1999 a systematic survey of starburst galax- 
ies and the intergalactic material near them. Our strategy 
was to apply QSO absorption-line and faint-galaxy tech- 



niques to the same volumes of space. Comparing the lo- 
cations of galaxies and intergalactic absorbing gas would 
let us map the relative spatial distributions of diffuse and 
collapsed baryons throughout large volumes of the uni- 
verse. We felt that these maps would test our understand- 
ing of the evolving baryonic universe in a number of ways. 
Because one of the most robust ways to measure a blast 
wave's energy is to see how far out of its galaxy's poten- 
tial it has managed to climb, searching for disturbances 
to the intergalactic gas near galaxies seemed a particu- 
larly promising way to measure at last the strength of 
supernovae-driven winds. 

We chose to conduct the survey at redshift z <~ 3. Our 
reasons were primarily practical. First, experience had 
shown us that galaxies at z ~ 3 were easy to find in deep 
images and easy to study spectroscopically. Second, inter- 
galactic gas at z <~ 3 produces a wealth of strong absorp- 
tion lines that are easily detected from the ground in opti- 
cal high-resolution spectra of background QSOs. At higher 
rcdshifts the galaxy spectroscopy becomes prohibitively 
difficult (e.g., Steidel et al. 1999) and background QSOs 
become increasingly rare; at lower redshifts the strongest 
intergalactic absorption lines can only be detected through 
difficult space-based spectroscopy. Similar surveys can be 
constructed at very low redshifts, z <~ (e.g., Norman et 
al. 1996), but it is difficult to cover representative comov- 
ing volumes, and in any case simple arguments suggest 
that the strongest supernova-driven winds likely existed 
at high redshift, when galaxies were less massive and star- 
formation rates were far higher (e.g., Adclbcrger & Steidel 
2000, §4). 

This paper, the first in a series, describes the survey 
and its early results. It compares the relative spatial dis- 
tributions of Lyman-break galaxies (LBGs), intergalactic 
metals, and intergalactic neutral hydrogen and emphasizes 
the evidence that star formation in the LBGs directly in- 
fluences the properties of the nearby IGM. Paper II in the 
series (Adelberger et al. 2003) sharpens these conclusions, 
focusing on the unexpectedly strong larger-scale correla- 
tions between galaxies and HI in the IGM and the impli- 
cations for the physics of the IGM. Paper III (Adelberger 
2003) examines in more theoretical detail the possibility 
that superwinds from star-forming galaxies are responsi- 
ble for the observed correlations between the galaxies and 
the IGM. We tacitly assume throughout the series that 
supernovae would be the primary source of any explosive 
energy release in young galaxies, but AGN are an equally 
plausible candidate. The source of the energy makes little 
difference to our conclusions. 

§2 presents the observational strategy and some aspects 
of the data reduction. §3 describes the characteristics of 
intergalactic gas within <~ 5/i _1 comoving Mpc of galaxies. 
This is farther than the galaxies' winds are likely to prop- 
agate, but the discussion sets the stage for §4 where we 
describe the state of the intergalactic medium nearest the 
galaxies. It appears to be rarefied and metal-enriched, ob- 
servations that may suggest the galaxies' winds have prop- 
agated out to comoving radii r ~ 0.5/i _1 Mpc. In §5 we 
show the two-dimensional correlation functions of galaxies 
with galaxies, with intergalactic HI, and with intergalac- 
tic CIV. These data show the connection of galaxies and 
intergalactic gas on spatial scales between those of §3 and 
§4. §5 contains a brief and largely empirical summary. 
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2. DATA 
2.1. Observations 

Our strategy was to use the Lyman-break technique to 
locate star-forming galaxies at z ~ 3.0 ±0.25 in fields sur- 
rounding background QSOs whose spectra were suitable 
for measuring Lyman-a absorbing gas at z ~ 3. Two cri- 
teria were used to select fields. First, we wanted the QSO 
to lie at 3.3 ^ z 3.6 and to be bright enough for high- 
resolution spectroscopy. The lower redshift limit was cho- 
sen to maximize the number of Lyman-break galaxies at 
redshifts where HI absorption was probed by absorption in 
the QSO spectra; the upper limit was chosen to minimize 
the impact of Lyman-continuum and (especially) Lyman- 
absorption at higher redshifts upon the z ~ 3 Lyman- 
a forest in each QSO spectrum. Second, we wanted the 
fields to have as little 100/im cirrus flux from the Galaxy 
as possible, to minimize the attenuation of Lyman-break 
galaxies' light from dust in our own galaxy. 

We were able to obtain data in five fields satisfying these 
criteria (Table 1); one field contains two QSOs. Our sam- 
ple was augmented to six fields by including data from the 
"SSA22" region where previous Lyman-break observations 
(Steidel et al. 1998) had discovered a QSO at z = 3.352. 

High resolution (R ~ 40000; see table 1) spectra of 
Q0256-0000, Q0302-0019, Q0933+2845, Q1422+2309 4 , and 
Q2233+1341 were obtained with the HIRES echelle spec- 
trograph (Vogt 1994) on Keck I between 1996 and 2000. 
Two overlapping echelle grating angles were chosen to pro- 
vide complete wavelength coverage through the z ~ 3 
Lyman-a forest. SSA22D13 was observed with the Echel- 
lette Spectrograph and Imager (ESI; Sheinis et al. 2000) 
on Keck II in echellette mode (R ~ 6000) in June and 
August 2000. An ESI spectrum was also obtained for 
Q1422+2309b, a faint QSO that we discovered ~ 40" from 
Q1422+2309. The HIRES spectra were reduced in the 
usual way with Tom Barlow's Makee package. Their con- 
tinua were fit using an interactive program kindly provided 
by R. Simcoe. The ESI spectra were reduced and contin- 
uum fit with the "Dukee" suite of custom IRAF scripts 
and C programs written by KLA and M. Hunt. 

U n GlZ images were obtained in each field, with COS- 
MIC (Kells et al. 1998) on the Palomar 5m Hale Telescope 
for Q0256-0000, Q0933+2845, Q2233+1341, and two ad- 
jacent pointings in SSA22, and with the Prime Focus Im- 
ager on the William Herschel Telescope for Q0302-0019 
and Q1422+2309. The Palomar images of Q0933+2845 
and WHT images of Q0302-0019 were supplemented with 
data obtained for another project with MOSAIC on the 
Kitt Peak 4m Mayall telescope. The images were reduced 
and photometric catalogs were constructed as described 
in Steidel et al. (1999). The typical reduced image depths 
(la) were ~ 29.1, 29.2, 28.6 AB magnitudes per arcsec 2 
in U n ,G, and ft, and approximately 2.2 objects per square 
arcminute in each field were found to satisfy the Lyman- 
break selection criteria 

U n -G > G-ft+1.0, G-ft<1.2, ft < 25.5. (1) 

In Q1422 exceptionally deep data let us extend our sample 
to ft = 26. 

4 The beautiful spectrum of this QSO was taken and generously 
shared by W. Sargent 



Low resolution (R ~ 600) spectra for a subset of these 
Lyman-break candidates were obtained with LRIS (Oke 
et al. 1995) on the Keck I and II telescopes between 1995 
and 2001. Each field was observed through at least 4 multi- 
slit masks that accommodated ~ 20 objects each. Spectra 
were obtained for Lyman-break candidates throughout the 
regions observed in U n GlZ, except in the cases of Q0302 
and Q0933 where spectra were obtained only in a ~ 7' — 9' 
region surrounding the QSO rather than throughout the 
larger imaged fields. The field sizes and number of use- 
ful redshifts obtained in each field are listed in Table 1. 
Figure 1 shows the redshift histogram for our galaxy sam- 
ple together with the range of redshifts where our QSO 
spectra could be used to detect intergalactic CIV and HI. 
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Fig. 1. — The range of redshifts probed in our survey. The 
solid histogram in the background shows the distribution of galaxy 
redshifts in the primary sample; refer to left axis. The cross-hatched 
histograms in the foreground show the number of primary-sample 
QSOs that could be used to detect CIV and Lyman-a at different 
redshifts; refer to right axis. The Lyman-a histogram conservatively 
assumes that we could not detect Lyman-a in parts of the spectra 
that were contaminated by Lyman-/3 from gas at higher redshifts. 
The CIV histogram assumes that we could not detect CIV in parts of 
the spectra that were contaminated by Lyman-a from gas at higher 
redshifts. Neither assumption is strictly true. 

These six fields make up the primary sample used in 
most of our analysis below. In a few cases we also used 
data in fields surrounding two additional QSOs, Q0000- 
2620 and Q0201+1120. U n GK images for Q0000-2620 
were obtained with EMMI on the 3.6m NTT telescope in 
1994; images for Q0201+1120 were obtained with COS- 
MIC on the Palomar 5m Hale telescope in 1995. Our 
echelle spectrum of Q0000-2620 is a combination of a 
HIRES spectrum taken by W. Sargent and the UVES 
(Dckker et al. 2000) spectrum made public after the in- 
strument's commissioning. P. Molaro kindly provided us 
with the reduced UVES spectrum. A HIRES spectrum 
of Q0201+1120 was obtained in 1999 and reduced as de- 
scribed above; this spectrum is the same as the one ana- 
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Table 1 
QSO/LBG Fields 
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Primary: 




















Q0256-0000 


02 h 59 m 05.6 


+00°11'22" 


3.364 


18.2 


44000 


41800 


8.5' x 8.5' 


45 


38 


Q0302-0019 


03 h 04 m 49.9 


-00°08'13" 


3.281 


17.8 


44000 


21000 


6.5' x 6.9' h 


47 


23 


Q0933+2845 


09 h 33 m 37.3 


+28°45'32" 


3.428 


17.5 


33000 


28800 


8.9'x 9.3' h 


65 


32 


Q1422+2309 1 


14 h 24 m 38.1 


+22°56'01" 


3.620 


16.5 


44000 


52800 


7.3' x 15.6' 


111 


62 


Q1422+2309b 


14 h 24 m 40.6 


+22°55'43" 


3.629 


23.4 


6000 


43600 








SSA22D13 


22 h 17 m 22.3 


+00°16'41" 


3.352 


21.6 


6000 


24600 


8.7' x 17.4' 


79 


54 


Q2233+1341 


22 h 36 m 27.2 


+13°57'13" 


3.210 


20.0 


44000 


68400 


9.2' x 9.3' 


47 


29 


Supplemental: 




















Q0000-2620 


00 h 03 m 22.9 


-26°03'17" 


4.098 


19.4 


48000 


12500 


3.7' x 5.1' 


18 


1 


Q0201+1120 


02 h 03 m 46.7 


+11°34'45" 


3.610 


20.1 


44000 


40500 


8.7' x 8.7' 


19 


11 



a QSO coordinates 

b QSO AB magnitude 

c Spectral resolution 

d Spectral integration time (sec) 

°Angular size of surrounding region with known z ~ 3 galaxies 
f Number of Lyman-break galaxies with spectroscopic redshifts 

g Numbcr of Lyman-break galaxies with ZLy0 < z < zqso — 0.05 and 2.6 < z < 3.4 

h Rcgion with spectroscopic follow-up; images cover a larger area 

"Sum of lensed components A and C (see, e.g., Rauch, Sargent, Sz Barlow 1999) 



lyzed by Ellison ct al. (2001). LRIS on Keck I was used 
in 1995 and 1996 to measure redshifts for a few objects 
in each field whose U n GlZ colors satisfied equation 1. We 
excluded these two fields from most of the analysis for a 
number of reasons. Their images are shallow. Few spec- 
troscopic redshifts were measured. The U n GlZ filters used 
for observing Q0000-2620 differed somewhat from those 
used in the rest of the observations, leading to significant 
changes in the redshift distribution of spectroscopically ob- 
served galaxies. Moreover Q0000-2620 is at so high a red- 
shift that most of the interesting CIV lines are buried in 
the Lyman-a forest and most of the Lyman-a forest close 
to our galaxies is badly affected by absorption from Lyman 
series lines from gas at higher redshifts. The spectrum of 
Q0201+1120 contains large gaps between echelle orders in 
the red that prevented us from detecting CIV systems in 
a uniform way. Nevertheless in two cases below — figures 9 
and 10 — we were able to make some use of these data. 

2.2. Redshifts 
2.2.1. Initial wavelength calibration 

Of the many steps required to reduce our data, one of 
the most crucial is the wavelength calibration, the esti- 
mate of which wavelengths of light were cast on each pixel 
by the optics of the spectrograph. This section describes 
how we calibrated our LRIS data. The HIRES and ESI 
calibrations were similar. 

The data for a single multislit mask typically consisted 
of three 30 minute exposures followed immediately by a 
brief spectrum of an arc lamp at the same telescope po- 
sition as the final exposure. To correct for gravitational 
flexure of the instrument during the series of exposures, 



spectra from the earlier exposures were shifted in the wave- 
length direction as required to make their sky lines overlap 
with the sky lines in the last exposure. The required shifts 
often approached but rarely exceeded 3A(^ 1 pixel). The 
data were then summed and a wavelength was assigned 
to each pixel based on a 5th order polynomial fit to the 
wavelength vs. pixel number relationship in the arc-lamp 
exposure. Occasionally the wavelengths assigned to sky 
lines by this procedure were systematically incorrect by 
;$ 2A; in these cases each spectrum was shifted by a fixed 
wavelength increment. In the resulting spectra, the mean 
difference between the true and estimated sky-line wave- 
lengths was negligible and the lcr scatter was ~ 0.25A. The 
wavelengths assigned to each pixel were finally adjusted by 
small amounts using standard formulae to correct for the 
motion of the earth around the sun and for the decrease 
in wavelength due to the index of refraction of air. After 
applying this procedure (or minor variants) to our data 
from LRIS, HIRES, and ESI, we were left with spectra in 
a common vacuum-heliocentric frame that allowed them 
to be intercompared. 

Although this approach is standard, readers should be 
aware that it is not perfect. For example, assigning cor- 
rect wavelengths to sky lines in our low-resolution spectra 
does not guarantee that the wavelength assignment will be 
correct for the galaxies: if differential atmospheric refrac- 
tion or errors in slitmask alignment, guiding, or astrometry 
caused the centroid of a galaxy's light to be displaced by 
O'.'l from the center of its 1'.'4 slit, the wavelength solution 
for the galaxy would differ by ~ lA from the wavelength 
solution for the sky lines. We made no attempt to correct 
for this difficult problem. It may contribute significantly 
to our derived uncertainties in the galaxy redshifts, which 
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we will shortly discuss. 

2.2.2. Correction for galactic winds 

The ~ 30 km s _1 velocity of the earth about the sun 
perturbs our redshifts by Az ~ 0.0001. The gravitational 
flexure of the spectrograph and the slowing of light by 
the earth's atmosphere perturb them by larger amounts 
Az ~ 0.001. Larger still are the perturbations due to 
the chaotic motions of material within Lyman-break galax- 
ies themselves. The various emission and absorption lines 
within the spectrum of a single Lyman-break galaxy sel- 
dom have the same redshift. Absorption lines from the 
cool interstellar gas tend to have the lowest redshifts; neb- 
ular emission lines from the hot gas close to stars tend 
to have slightly higher redshifts; and Lyman-a almost al- 
ways has the highest redshift (e.g. Pcttini ct al. 2002; 
Pettini et al. 2001). The redshift range spanned by the 
interstellar lines and Lyman-a often exceeds Az = 0.01 
(~ 750kms _ ; see Fig. 2). This reflects velocity differ- 
ences within the galaxy itself — the observed redshift dif- 
ferences are qualitatively consistent with the idea that a 
typical Lyman-break galaxy is expelling some fraction of 
its interstellar gas in a supernovae-driven wind (figure 3; 
see also Tenorio-Tagle et al. 1999) — but if the redshift 
differences were erroneously assumed to be caused by the 
Hubble flow, the implied comoving distance between the 
reddest and bluest features would be ~ 7/i _1 Mpc (z = 3, 
VLm = 0.3, Oa = 0.7). The challenge is to estimate from 
our data the redshift of each Lyman-break galaxy's stars, 
to pick out where each Lyman-break galaxy lies within the 
~ 7h Mpc range that our observations would seem to 
allow. 
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Fig. 2. — The distribution of velocity differences between Lyman- 
a emission and interstellar absorption in the spectra of Lyman-break 
galaxies. 



Our approach was guided by the assumption that the 
redshift of a galaxy's nebular lines (e.g., [OII]A3727, H/3, 
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Fig. 3. — Schematic view of the different redshifts in the spectrum 
of a single Lyman-break galaxy. Narrow (<jid ~ 70 km s _1 ) nebular 
emission lines have been detected through the near-IR spectroscopy 
of 27 Lyman-break galaxies. These emission lines presumably lie 
near the galaxy's systemic redshift. Wide interstellar absorption 
lines are usually seen blueward of the nebular lines. Lyman-a emis- 
sion of varying widths lies redward. The relative redshifts can be un- 
derstood through a simple wind model for the gas in Lyman-break 
galaxies. Absorption of stellar continua is produced by outflow- 
ing (i.e., blueshifted) gas lying between the observer and the stars. 
Resonant scattering sends Lyman-a photons on long random walks 
through the dusty outflow. The few photons that escape without 
being absorbed by dust are those that were scattered along unusu- 
ally short paths out of the galaxy; most of these photons scattered 
off the redshifted back of the outflow and along the steep velocity 
gradient towards us. 



[OIII]AA4959,5007) ought to be nearly equal to the red- 
shift of its stars, that the gas responsible for nebular emis- 
sion should always lie close to hot stars. Because nebu- 
lar lines are redshifted to the near-IR, where spectroscopy 
is difficult, we were unable to measure their redshifts for 
the vast majority of galaxies in our sample. Instead we 
searched for correlations between nebular-line redshifts 
and UV spectral characteristics among the 27 Lyman- 
break galaxies that have measured nebular redshifts, then 
used these correlations to estimate the systemic redshift 
of each galaxy in our larger sample from its rest-frame 
UV spectrum. The near-IR data for 14 of these 27 galax- 
ies were taken from Pettini et al. (2001); our NIRSPEC 
(McLean et al 1998) spectra of the remaining galaxies will 
be presented elsewhere. 

The following four relationships were found to hold for 
the 27 galaxies with nebular redshifts. Each resulted from 
a singular-value decomposition solution of a linear least- 
squares equation (e.g. Press et al. 1994, §15.4). 

Among the galaxies with detectable Lyman-a emis- 
sion, the velocity of Lyman-a relative to the nebular lines 
roughly satisfied 
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■ 9W\ kms" 



(2) 



where W\ is the rest-frame equivalent width of Lyman- 
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a in A . The rms scatter about this mean relationship 
was 170km s _1 . The sense of this correlation is consistent 
with the idea that dust absorption of resonantly scattered 
photons is responsible for driving Lyman-a to the red; the 
reddest Lyman-a lines ought to be the weakest. 

Averaging equation 2 over the distribution of W\ among 
galaxies with detectable Lyman-a emission but no de- 
tectable absorption lines leads to the mean relationship 

w LyQ ~310 kms -1 . (3) 

This equation can provide an estimate of a Lyman-a- 
emitting galaxy's systemic redshift when W\ has not been 
measured, but the redshift's precision (cr v <~ 250 km s _1 ) 
suffers. Note that equation 3 is applicable only to data of 
quality similar to ours, since the adopted distribution of 
W\ was appropriate for galaxies with detectable Lyman- 
a emission and no detectable interstellar metal lines, and 
this classification is affected by the signal-to-noise ratio of 
our typical spectra. 

Among galaxies that had both measurable Lyman-a and 
interstellar-absorption redshifts, the mean of the Lyman- 
a and interstellar redshifts was larger than the nebular 
redshift by an amount 

i; mid ~ -0.014Au - 7.0W X + 280 kms" 1 (4) 

if W\ was used in the fit or 

v mid ~ -O.lUAv + 230 kms -1 (5) 

if W\ was ignored. Here Av is the observed velocity differ- 
ence between Lyman-a and the absorption lines. The rms 
scatter about equations 4 and 5 was 150 and 210km s -1 
respectively. The physical origins of correlations 4 and 2 
are similar. 

The mean velocity difference between the interstellar 
lines and nebular lines for galaxies with measurable in- 
terstellar redshifts was 

u abs ^ -150 kms -1 (6) 

with an rms scatter of 160km s _1 . The qualitative picture 
of figure 3 suggests that taking account of the velocity 
widths of the interstellar absorption lines might improve 
the accuracy of equation 6, but in practice the resolution 
of galaxy spectra, set by the seeing to ~ 300-600 km s _1 
(FWHM), was unusably coarse. 

Two anomalous galaxies were excluded when calculating 
each of these fits and their scatter. The large residuals of 
the excluded galaxies could be traced back to their unusual 
spectra. One had two Lyman-a emission lines separated 
by <~ 1000 km s _1 ; the other had interstellar absorption 
at two redshifts separated by more than 1500km s~ . As 
far as we could tell nothing about the UV spectra of these 

5 Here W\ refers to the equivalent width of only the part of the 
Lyman-a line that is observed in emission. It is therefore non- 
negative by definition. Because the lines often have P-Cygni profiles, 
the values W\ we measure from our spectra according to this defini- 
tion will generally differ from the equivalent widths one would derive 
from (e.g.) a narrow-band image; we would record a positive (emis- 
sion) value for W\ if we saw a weak Lyman-a emission line at the 
red edge of a deep Lyman-a absorption trough, while narrow-band 
photometry would reveal only that Lyman-a appeared dominantly 
in absorption. 



objects would have allowed us to predict their nebular red- 
shifts with much precision. We chose to give up on the 2 
pathological cases and optimize our fits for the 25 normal 
galaxies where we had some chance of success. Readers 
should be aware that for perhaps 1 galaxy among 10 the 
equations above will lead to an estimated redshift that is 
incorrect by several times the quoted rms. 

Equations 2 or 3, 4 or 5, and 6 were used to esti- 
mate the systemic redshifts of galaxies in our sample that 
had, respectively, detectable Ly-a emission only, both de- 
tectable Ly-a emission and interstellar absorption, and 
only detectable interstellar absorption. Lyman-a equiv- 
alent widths were measured only for galaxies within 60" 
of a QSO sightlinc, because only for these galaxies did 
the moderately improved precision of equations 2 and 4 
relative to 3 and 5 make a significant difference. 

We can check the accuracy of our redshift estimates by 
noting that that the cross-correlation function of galaxies 
and intergalactic material will be isotropic in an isotropic 
universe. On average the intergalactic medium ought to be 
symmetric when reflected in the z direction about the true 
location of each Lyman-break galaxy. The same is not true 
for reflection about the measured interstellar or Lyman-a 
redshift of the galaxies, because these redshifts are influ- 
enced only by material that lies between the galaxies' stars 
and us, and so the symmetry is broken. 

Figure 4 shows the mean Ly-a transmissivity of the in- 
tergalactic medium for all pixels of our QSO spectra that 
lay within an apparent distance of 4/i _1 and 6/i _1 comov- 
ing Mpc of a Lyman-break galaxy. Also shown is the mean 
transmissivity we would have measured had our estimated 
galaxy redshifts been systematically higher or lower by an 
amount ranging from Az — —0.02 to Az = +0.02. The 
three panels correspond (from top to bottom) to galaxies 
with only detectable absorption, both detectable absorp- 
tion and emission, and only detectable emission. In each 
case the curves of mean transmissivity vs. Az are roughly 
symmetrical about Az = 0, which suggests that the sys- 
temic redshifts assigned through equations 2,5, and 6 are 
accurate on average. The mean redshift offset of the emis- 
sion and/or absorption lines for each class is marked by the 
vertical bars. (Readers may find other pertinent data in 
Figure 25, which is discussed in a different context below). 

Further evidence of the procedure's accuracy comes from 
recent work by Shapley et al (2003, in preparation), who 
used the above equations to shift the spectra of ~ 800 
LBGs into a common rest-frame. Adding these shifted 
spectra together revealed weak stellar photosphcric lines 
with velocity v — ± 30kms _1 , showing that on average 
the equations successfully estimate the redshift of the stars 
in Lyman-break galaxies. 

3. GALAXIES AND INTERGALACTIC GAS AT LARGE 
SEPARATIONS 

We are now ready to discuss the relative spatial distri- 
butions of galaxies and intergalactic material at z ~ 3. 
This section will concentrate on the properties of the in- 
tergalactic gas that lies within a few comoving Mpc of the 
galaxies in our sample. Simple arguments based on en- 
ergetics suggest that a wind driven by the supernovae in 
a Lyman-break galaxy will be unable to propagate more 
than <~ lh^ 1 comoving Mpc from its source (e.g., Adel- 
berger 2003), and so data averaged over several Mpc will 
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Fig. 4. — The mean transmissivity of all parts of the QSO spectra 
that lie within 4/i -1 (solid line) or 6/i -1 (shaded region) comoving 
Mpc of a Lyman-break galaxy. The height of these curves at Az = 
shows our best estimate from the redshift assignment described in 
§ 2.2.2; the height of the curves at other values of Az shows the mean 
transmissivity we would have measured had we applied an additional 
redshift adjustment of Az to each galaxy. Large systematic errors in 
our redshift estimates would show up as asymmetries about Az = 
in this plot, but in fact Az = lies near the minimum of each curve. 
The sample was divided into classes denned by the type of features 
we could detect in each object's spectrum. The mean redshift of the 
detected features relative to our assigned redshift is marked with 
vertical bars, short for absorption lines and tall for emission lines. 
The size of our estimated uncertainty (Her) in a single galaxy's 
redshift is indicated by the rectangular strip surrounding Az = 0. 



not tell us much directly relevant to the observed galaxies' 
impact on their surroundings. But galaxies ought to form 
preferentially where the large-scale density of dark matter 
is high (e.g., Kaiser 1984), and we can expect that the 
bright galaxies we observe will lie near numerous fainter 
galaxies and near any sites of previous star formation. The 
cumulative effect of winds from the seen and unseen galax- 
ies is something the results of this section could in principle 
reveal. 

3.1. HI 

One of the most striking aspects of our data is the fact 
that the transmissivity of the Lyman-a forest tends to be 
low in volumes that contain a large number of Lyman- 
break galaxies. Figure 5 shows the Lyman-a forest along 
skewers through the ~ 10 x 10 x 10h~ 3 comoving Mpc 3 
cubes that contain the two most significant galaxy over- 
densities and the two most significant galaxy underdensi- 
ties in our sample. The mean transmissivity is much lower 
through the galaxy overdensities. The trend is observed 
throughout the range of galaxy densities, not merely at 
the extremes. This can be seen in figure 6, which shows 
the mean galaxy overdensity in 3-dimcnsional cells sur- 
rounding the QSO sightlines in our primary sample as a 
function of the mean flux /0.02 on the Az = 0.02 sightlinc 



segment enclosed by the cell. Each cell was a right rectan- 
gular parallelepiped with depth Az = 0.02 and transverse 
dimensions equal to its CCD image's (table 1); in comov- 
ing units each was roughly a cube with side-length 13/i -1 
comoving Mpc (CIm = 0.3, J7a = 0.7). The mean galaxy 
overdensity associated with Lyman-a forest spectral_ seg- 
ments with mean transmissivity in the range /o.u2±£/ was 
calculated by summing the observed number of galaxies in 
every cell with a mean transmissivity in that range, then 
dividing by the sum of the number of galaxies that would 
have been expected in those cells if Lyman-break galaxies 
were distributed uniformly in space: 



(4 



gal/ 



E cells 
i N,- 



1- 



(7) 



where iVj is the observed number of galaxies in the ith cell 
and fj, 3l i is the expected number in the absence of clus- 
tering. fi g i was estimated by scaling our average selection 
function by the number of galaxies with spectroscopic red- 
shifts in the appropriate field. See appendix B for a justifi- 
cation for this estimator. In order to (slightly) reduce the 
shot noise, the centers of adjacent cells were separated by 
Az = 0.004, so that we oversampled our data by a factor 
of 5. The QSO spectra were used over the redshift range 
(1 + z q )X Ly p/ X Lya - 1 < A/ A Lyct - 1 < z q - 0.05, where z q 
is the QSO redshift, in order to avoid contamination of the 
Lyman-a forest by material associated with the QSO or 
by Lyman-/? absorption from gas at higher redshifts. Seg- 
ments of the QSO spectra with 2.91 < A/A LyQ - 1 < 2.98 
in SSA22 and with 3.214 < A/A Ly „ - 1 < 3.264 in Q0933 
were removed from the analysis to avoid damped Lyman-a 
systems. 

Some may be surprised that the correlation between HI 
and galaxy density does not break down at large values of 
(^gai)- The largest galaxy overdensities in our sample are 
likely to evolve into rich clusters by z = 0; this conclusion 
follows from the comoving volume between z — 2.7 and 
z = 3.3 that we have observed, ~ 3.7 x 10 5 ft.~ 3 Mpc 3 for 
fiM = 0.3, f^A = 0.7, which is large enough to contain 
~ 3 structures that are destined to evolve into clusters 
with X-ray temperatures kT > 2.5keV at z ~ 0. We have 
no better candidates for the protoclusters than the two 
overdensities shown in figure 5 or (more generally) than 
the large galaxy overdensities associated with the lowest 
/0.02 bins in figure 6. A number of arguments suggest 
that the intracluster medium was heated at early times, 
before the cluster itself had formed, and this has led some 
to speculate that young clusters at z ~ 3 might contain 
far less neutral hydrogen than the universal average (e.g. 
Theuns, Mo, & Schaye 2001). Figures 5 and 6 show that 
the opposite is true. The large Lyman-a opacities of (pre- 
sumed) intracluster media at early times do not imply that 
preheating has not happened, however, as we will argue in 
Adelberger (2003): putative shocks traveling outward from 
high-redshift galaxies and heating the young intracluster 
medium could easily increase the mean HI content of the 
volumes they affect. See Adelberger et al. (2003) for an 
extended discussion of Figure 6 and its implications. 

3.2. CIV 

The notion that the intergalactic material within galaxy 
overdensities may have already been preheated receives 
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Fig. 5. — Intergalactic absorption in the ~ 10 X 10 X 10/i -3 
comoving Mpc 3 cells that contain the two most significant galaxy 
overdensities (top panel) and underdensities (bottom panel). The 
shaded curves show the Lyman-a forest absorption along segments 
of our QSO spectra that pass through the overdensities. Each seg- 
ment is ~ 17h~ comoving Mpc long for Qm = 0.3, Q\ = 0.7. 
Galaxy redshifts are indicated with vertical lines. The height of 
each line is proportional to the galaxy's distance to the QSO sight- 
line; a line reaching flux/cont = 1 represents a galaxy 10' away 
(~ 12.9k' 1 comoving Mpc for Q M = 0.3, tt A = 0.7). The quoted 
overdensities 5 ge j are measured in a 10h~ 1 Mpc cube at the cen- 
ter of the panel. The galaxy overdensity in the upper left panel is 
the "spike" in SSA22 (e.g., Steidel et al. 1998), the largest galaxy 
overdensity in the entire Lyman-break galaxy sample and our best 
candidate for a proto-Abell cluster. These data suggest that the 
intercluster medium contained a relatively large amount of neutral 
hydrogen at early times. 



some support from its observed metal content. The asso- 
ciation of Lyman-break galaxies and metals can be quanti- 
fied in any number of ways, and will receive more attention 
below; but the qualitative point we wish to make here is 
adequately illustrated by figure 7. This figure shows the 
redshifts of the Lyman-break galaxies and CIV systems in 
the field of Q1422+2309, the QSO with the best spectrum 
in our sample. The top panel shows the number of galaxies 
and CIV systems in redshifts bins of width Az = 0.025. 
The bottom panel shows the result of smoothing the raw 
redshifts by a Gaussian with width a z = 0.008 (~ 5.4/i _1 
comoving Mpc for £Im = 0.3, fl\ = 0.7) and then dividing 
by the selection function. The selection function used for 
the galaxies, shown in the top panel of the figure, is a spline 
fit to a coarsely binned histogram of every Lyman-break 
galaxy redshift in our sample. The selection function for 
CIV systems was approximated as constant with redshift. 
Though obscured somewhat by shot noise, the connection 
between galaxy density and CIV density is in fact sur- 
prisingly strong; Pearson's correlation coefficient between 
the two curves in the bottom panel of figure 7 is 0.61. It 
is interesting that the amplitudes of the CIV and galaxy 
fluctuations are comparable. The figure suggests that a 
volume of the universe which is overdense in galaxies by a 
factor of 1 + 5 ~ 3 (e.g.) will tend to be overdense in CIV 
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Fig. 6. — The mean galaxy density in a ~ 13/i _1 comoving Mpc 
cubical cell as a function of the mean Lyman-o forest transmissivity 
along the Az = 0.02 sightline skewer through the cell. Galaxy over- 
densities contain large amounts of neutral hydrogen; galaxy voids 
contain less. Arrows indicate the mean value of /0.02 among all 
sightline segments in our sample. 



systems by a factor of ~ 3 as well. But because Lyman- 
break galaxies are biased tracers of the matter distribution 
(e.g., Adelberger et al. 1998), the baryonic overdensity of 
the same volume will be significantly smaller. This shows 
that there is more detectable CIV absorption per baryon 
in galaxy overdensities than elsewhere. One of several pos- 
sible interpretations is that the intergalactic metallicity is 
enhanced near galaxies at z ~ 3. A similar trend might 
also result from the density-dependence of carbon's ion- 
ization state. 

3.3. Hell 

Heap et al. (2000) obtained a spectrum at wavelengths 
1140 £ A obs £ 1300A of one of the QSOs in our sam- 
ple, Q0302-0019. This spectrum revealed that the Hell 
Lyman-a (A rcs t — 304A) optical depth of the intergalac- 
tic medium in front of the QSO was large and surprisingly 
variable. The observed range of optical depths in the spec- 
trum, THoii ~ 1 to THoii ~ 5, apparently implies that the 
hardness of the ionizing background and the ratio of HI 
to Hell number density must vary significantly in the in- 
tergalactic medium at z ~ 3. Figure 8 shows that the 
observed variations in intergalactic Hell opacity appear to 
be spatially correlated with the locations of galaxies and 
CIV systems. Excluding the region z > 3.24, which is pre- 
sumably affected by radiation from Q0302-0019 itself (e.g. 
Hogan et al. 1997), and the region 2.987 < z < 3.016, 
which is contaminated by geocoronal Lyman-a emission, 
we calculate r s ~ 0.21, 0.27 for the value of Spearman's 
rank correlation coefficient between the smoothly varying 
galaxy, CIV overdensities shown in figure 8 and the Hell 
absorption spectrum. The galaxy and CIV overdensities 
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Fig. 7. — The redshift distributions of galaxies and CIV absorbers 
in the field of Q1422+2309. The top panel shows the number of ob- 
jects observed at each redshift. The bottom panel shows the implied 
overdensity as a function of redshift after smoothing our raw red- 
shifts by a Gaussian of width a z = 0.008. The good correspondence 
of features in the lower panel shows that CIV systems are preferen- 
tially found within galaxy overdensities. 



were calculated as described above, near figure 7. Corre- 
lation coefficients of this size, r s ~ 0.2-0.3, do not imply 
perfect correspondence, and readers will easily find exam- 
ples in Figure 8 where gaps in Hell opacity have few asso- 
ciated galaxies or metals in our sample (e.g., at z = 2.83 
and z = 3.05). Nevertheless the statistical correlations 
visible in the figure are moderately significant. A corre- 
lation strength r s > 0.21 was found for roughly 13% of 
randomized galaxy catalogs that we correlated with the 
Hell spectrum; r s > 0.27 was found for roughly 6% of the 
randomized CIV catalogs. It is easy to think of reasons 
that the Hell opacity of the intergalactic medium might 
decrease near galaxies or CIV systems. The most obvi- 
ous is that the reionization of Hell should happen first 
in the dense regions where galaxies, metal-line absorbers, 
and AGN reside. But this is unlikely to be the full ex- 
planation; the Hell optical depth at the mean density at 
z ~ 3 would be of order 1000 if Hell were the dominant 
ionization state, and so the small but significant fraction 
of the QSO's light that is detectable at A 1310A shows 
that Hell must already be highly reionized almost every- 
where. A more likely explanation may be that the spatial 
clustering bias of galaxies and AGN causes the number of 
Hell-ionizing photons per baryon to be largest in overden- 
sities where galaxies and AGN tend to be found. Because 
the mean free path of 4Ryd photons at z ~ 3 is likely 
to be ~ 1500 km s _1 (Miralda-Escude, Hachnelt, & Rees 
2000), or roughly the size of independent bins in figure 8, 
we might reasonably expect to see this effect in the figure. 
We should also add that much of the evidence for a cor- 
relation between Hell transmissivity and galaxy or CIV 
density comes from redshifts z ^ 2.9. If the decrease in 



Hell opacity at these redshifts reflects a global change in 
the hardness of the ionizing background due to the grow- 
ing dominance of AGN (e.g. Songaila 1998; cf. Boksen- 
berg, Sargent, & Rauch 1998), then the correspondence 
of high Hell transmissivity with the observed galaxy and 
CIV overdensities would be only a coincidence, and much 
of our evidence for a connection between galaxies and Hell 
would be removed. 
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Fig. 8. — The correlation between galaxies, CIV systems, and 
intergalactic Hell transmissivity in the field of Q0302-0019. The 
curves in the foreground show the overdensity of galaxies and CIV 
systems as a function of redshift; they were produced by smoothing 
the list of redshifts with a Gaussian of width a z = 0.008. The shaded 
curve in the background shows the Hell Lyman-a absorption seen 
in the spectrum of Q0302-0019 (Heap et al. 2000) in units of 10~ 16 
erg s — 1 cm -2 A -1 (refer to right axis). The dotted line is the error 
spectrum. The Hell content of the IGM is detectably reduced from 
its mean value in regions where the spectrum significantly exceeds 
its uncertainty. These regions often (but not always) coincide with 
galaxy and CIV overdensities. 



3.4. Damped Lyman-a systems 

For completeness we will briefly describe the associa- 
tion of Lyman-break galaxies with damped Lyman-a sys- 
tems (table 2). This topic has been recently discussed 
by Gawiser et al. (2001). Figure 9 compares the mean 
overdensity of Lyman-break galaxies within cells centered 
on damped systems to the mean overdensity of Lyman- 
break galaxies within cells centered on other Lyman-break 
galaxies. Each cell was a cylinder with height Az = 0.025 
(~ \Q.7h~ 1 comoving Mpc for fijif = 0.3, J7a = 0.7) and 
radius equal to value of AO shown on the figure's abscissa. 
Even at the largest value of A9 (265", or r - 5.7h^ 1 Mpc 
comoving), each cylinder's diameter is significantly smaller 
than its height. This helped ensure that galaxies corre- 
lated with the central object would fall into the surround- 
ing cell even in the presence of substantial redshift errors. 
If Lyman-break galaxies and damped systems were similar 
objects, the mean density of Lyman-break galaxies around 
damped systems would be similar to the mean density of 
Lyman-break galaxies around other Lyman-break galaxies, 
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but this is not the case. The observed number of Lyman- 
break galaxies close to the damped systems in our sample 
is instead roughly what one would expect if the two popu- 
lations were independently distributed; we see no evidence 
for an excess of Lyman-break galaxies near damped sys- 
tems. In contrast the overdensity of Lyman-break galax- 
ies near other Lyman-break galaxies is large, as the filled 
circles in figure 9 show. Table 2 compares the observed 
number of Lyman-break galaxies near each damped sys- 
tem with the number one would have expected if damped 
systems and Lyman-break galaxies were the same objects. 
In this case, we would have expected to find 5.96 Lyman- 
break galaxies within A0 = 265", Az = 0.0125 of the 
damped systems. Instead we found 2. A Poisson distri- 
bution with true mean 5.96 will yield 2 or fewer counts 
about 6.4% of the time, so the significance of this result 
is slightly better than 90%. The significance can be as- 
sessed in a more empirical way by exploiting the fact that 
our spectroscopic sampling density in Q0933+2841 and 
SSA22D13 is very similar to the density in the rest of the 
Lyman-break galaxy sample. We selected at random from 
our Lyman-break galaxy catalogs many sets of two galax- 
ies, with one galaxy at roughly the redshift of the damped 
system in Q0933-I-2841, the other at roughly the redshift 
of the damped system in SSA22D13. We then counted the 
number of other Lyman-break galaxies in cylindrical cells 
surrounding the two galaxies in each set, and compared to 
the number of Lyman-break galaxies in cells surrounding 
the damped systems. The curve in figure 9 shows the fre- 
quency P(> n) with which a set of two random galaxies 
had more galaxy neighbors than the set of two damped sys- 
tems. The lack of Lyman-break galaxies within ~ 5.7/i _1 
comoving Mpc (AO < 265") of these two damped sys- 
tems lets us conclude with ~ 90% confidence again that 
bright Lyman-break galaxies and damped systems do not 
reside in similar parts of the universe. Taken together, the 
data in figure 9 suggest that the statistical association be- 
tween damped systems and Lyman-break galaxies is weak; 
the data are consistent with the idea that damped sys- 
tems tend to reside in small potential wells that are much 
more uniformly distributed in space than the massive wells 
that presumably host Lyman-break galaxies. Others have 
reached a similar conclusion from different starting points 
(e.g., Fynbo, Moller, & Warren 1999; Mo, Mao, & White 
1999; Haehnelt, Steinmetz, & Rauch 2000). 

4. GALAXIES AND INTERGALACTIC GAS AT SMALL 
SEPARATIONS 

Any influence of a galaxy's supernovae on its surround- 
ings should be most pronounced near to the galaxy itself. 
This makes it especially interesting to study the contents 
of the intergalactic medium near Lyman-break galaxies. 
Figure 10 shows the distribution of HI and CIV absorption 
along the QSO sightline segments that approach Lyman- 
break galaxies most closely. The angular separation of 
the galaxy from the sightline is marked in each panel. At 
z = 3 one arcminute corresponds to roughly l.3h co- 
moving Mpc for Qm = 0.3, Qa = 0.7, and so at closest 
approach these sightlines probe the intergalactic medium 
at ~ 0.3 to 1.3/i comoving Mpc from the galaxy. The 
shaded curves show the Lyman-a forest transmissivity. 
The symbol appears next to the impact parameter in 
each panel if Lyman-/? absorption from gas at higher red- 
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Fig. 9. — The spatial association of Lyman-break galaxies and 
damped Lyman-a systems. Square points show the mean over- 
density of Lyman-break galaxies in cylinders of constant depth 
Az = 0.025 (~ 17k- 1 comoving Mpc for n A/ = 0.3, fi A = 0.7) 
and variable radius A9 surrounding the four damped systems in 
our survey. 250" corresponds to roughly 5.4/i -1 comoving Mpc in 
the same cosmology. Circular points show the mean overdensity of 
Lyman-break galaxies in similar cylinders centered on other Lyman- 
break galaxies. A 5" offset was added to the true abscissae of the 
DLA points. All error bars assume Poisson statistics. Refer to left 
axis for scale. The curved line shows how frequently randomly cho- 
sen Lyman-break galaxies with similar redshifts to the two DLAs 
in our primary sample have more galaxy neighbors than the DLAs. 
Refer to right axis. Damped systems do not appear to cluster as 
strongly as Lyman-break galaxies. 



shifts could have affected the appearance of the Lyman-a 
forest at the galaxy's redshift. The horizontal line marks 
the mean transmissivity at the galaxy's redshift, estimated 
as described in the following paragraph. Circles mark the 
locations of detectable CIV absorption. The size of each 
circle is related to the CIV column density; a tripling of 
a circle's area corresponds to a factor of ten increase in 
column density. Due to significant gaps in our cchcllc 
spectrum of Q0201, we did not attempt to make a cat- 
alog of the CIV systems in this field. Our low quality 
spectrum of Q1422+2309b did not allow us to detect a 
significant number of CIV systems; panels for this QSO 
show the locations of CIV absorption in Q1422+2309 it- 
self, a QSO which is ~ 1' away. The short and tall vertical 
bars mark the observed redshifts of interstellar absorption 
and Lyman-a emission, respectively, in each galaxy's spec- 
trum. Wide shaded boxes mark the la confidence interval 
on each galaxy's systemic redshift, estimated from the ap- 
propriate equation among 2, 4, and 6. 

The values of the mean intergalactic Lyman-a transmis- 
sivity came from the following formulae. When the forest 
was not contaminated by Lyman-/? absorption, its mean 
transmissivity was taken to be 



f(z) = 0.676 - 0.2200 - 3), 



(8) 
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Fig. 10. — The distribution of neutral hydrogen and CIV absorption along the segments of the QSO spectra that pass closest to a Lyman- 
break galaxy. The shaded curve shows the Lyman-a forest. The horizontal line marks the mean transmissivity at this redshift in the QSO's 
spectrum. Circles mark the redshifts of detectable CIV absorption. Larger circles correspond to larger CIV column densities; a tripling in the 
circle's area corresponds to a factor of 10 increase in column density. Numerical values for the HI and CIV column densities of the absorbing 
gas near to one of these galaxies can be found in the last entry of table 2. The wide vertical region shows our estimated redshift (±1<t) for 
each galaxy. Narrower vertical bars mark the redshifts of Lyman-ct (tall bar) or interstellar absorption (short bar) in each galaxy's spectrum. 
The distance from each galaxy to the QSO sightline is indicated. 10" corresponds to roughly 200h~ 1 comoving kpc. A /3 next to the distance 
indicates that the Lyman-ct forest at this redshift may be contaminated by Lyman-/3 (or higher) absorption from gas at larger redshifts. 
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Table 2 
Damped Ly-o systems [+1] 



QSO 


2 


log(AWcm- 2 ) 


log(7V C jv/cm 2 ) 




ATex P b 


Q0000-2620 


3.3902 


21.3 


14.7 


1 


0.67 


Q0201+1120 


3.3864 


21.3 


13.9 





0.45 


Q0933+2845 


3.2352 


20.3 


12.8 


1 


2.23 


SSA22D13 


2.9408 


20.7 


13.1 





2.61 


SSA22D13 C 


2.7417 


15.1 


14.4 







a Obscrvcd number of Lyman-brcak galaxies with AO < 265 , Az < 0.0125 

b Expected number of Lyman-break galaxies if the DLA-LBG cross-correlation function 
were the same as the LBG-LBG correlation function 

c Not a damped system. This is the gas at a distance A9 — 17" (90/i _1 proper kpc), 
Az — 0.003 from a Lyman-brcak galaxy in SSA22 (figure 10). Its HI and CIV column 
densities arc listed for comparison. 



a fit to the relationship between mean transmissivity / and 
redshift presented by McDonald et al. (2000). The mean 
transmissivity of the forest contaminated by Lyman-/? and 
higher lines is roughly independent of redshift because the 
increased absorption in the blue due to high Lyman-series 
lines is largely canceled out by the gradual thinning of the 
forest towards lower redshifts. The mean transmissivity in 
this case was found to roughly obey the formula 

/ = 0.633 -0.40(2 QSO - 3.5) (9) 

where zqso is the QSO's redshift. These formulae are 
reasonable approximations only at the redshifts z ~ 3 that 
are of interest to us; they should not be used at other 
redshifts. 

4.1. A lack of HI near Lyman-break galaxies 

We would like to draw readers' attention to an interest- 
ing aspect of figure 10. Although the intergalactic medium 
within ~ 10/i _1 comoving Mpc of galaxy overdensities ap- 
pears to contain large amounts of neutral hydrogen (§3.1), 
often little neutral hydrogen is observed along the small 
segments of the QSO sightline that pass within ~ 0.5/i _1 
comoving Mpc of a Lyman-break galaxy. This is illus- 
trated more clearly in figure 11, which shows the mean 
Lyman-a transmissivity of the intergalactic medium as a 
function of apparent comoving distance from a Lyman- 
break galaxy 6 . We discarded parts of the QSO spectra 
that were contaminated by Lyman-/? absorption from gas 
at higher redshifts or by damped Lyman-a systems, and 
normalized the result so that the mean transmissivity at 
all radii would have been precisely 0.67 if the galaxies in 
our surveyed volumes were randomly distributed and in- 
finitely numerous. Rough error bars were calculated by 
generating a large ensemble of fake data sets, each con- 
taining the same QSO spectra and same galaxies as our 

6 More precisely, the figure shows (1 + f(r))/, where / = 0.67 is 
approximately the mean transmissivity at z = 3 and £ is an annu- 
lar average of the two-dimensional galaxy-flux correlation function 
£(rg,r z ) calculated as described in § 5 below. The annular average 
was weighted by £(rg,r z ), the total path length of our QSO spectra 
at a distance rg , r z from any galaxy in our sample. Scaling from the 
correlation function helps reduce systematic errors from the geome- 
try of our survey and from the change in mean transmissivity with 
redshift. 



actual sample, but with each galaxy's redshift modified by 
a Gaussian deviate with a z = 0.075 (~ 50/i _1 comoving 
Mpc for f^M = 0.3, Oa = 0.7). The error bars in the figure 
show the 1(7 scatter about the mean among these fake data 
sets. They may be underestimates of the true uncertainty, 
as discussed below. 

Figure 11 shows again the result discussed above (see 
figures 5 and 6): on scales of ~ 1 to ~ 5ft- -1 comoving 
Mpc Lyman-break galaxies are associated with an excess 
of neutral hydrogen. But on the smallest scales the trend 
appears to reverse. Little neutral hydrogen is found within 
<~ 0.5/i _1 comoving Mpc of the galaxies. Figure 12 shows 
that the change in mean transmissivity with distance de- 
rives from spatial variations in the relative proportion of 
lightly and heavily obscured pixels in the Lyman-a for- 
est spectra: few pixels with transmissivities less than 0.5 
are found within ~ lh^ 1 comoving Mpc of a Lyman- 
break galaxy, while a significant excess of saturated pixels 
with transmissivity ~ are found within ~ Ahr x Mpc of 
Lyman-break galaxies. 

The apparent lack of HI near galaxies is particularly 
interesting because it seems inconsistent with currently 
favored numerical models of the intergalactic medium at 
high redshift. In these models the density of baryons 
closely traces the density of dark matter and the p 2 de- 
pendence of the volumetric recombination rate causes the 
density of neutral hydrogen to be highest where the den- 
sity of dark matter is high. But galaxies also form where 
the density of dark matter is high, and as a result galaxies 
in the simulations tend to be surrounded by large amounts 
of neutral hydrogen. In the simplest models the density of 
HI increases rapidly at small galactocentric radii. This is 
illustrated by the curved lines in figure 11, which show the 
mean transmissivity around galaxies with baryonic mass 
M b > 10/i _1 M Q in the M = 0.3, n A = 0.7 smoothed- 
particle hydrodynamical (SPH) simulation of Croft et al. 
(2002; taken from their figure 9a). 

The differences between the simulation results and our 
data are not hard to discern. Rather than a continued in- 
crease in the HI density at smaller radii, we find that the 
HI density appears to level off at radii r 4/i~ 1 comoving 
Mpc and then decrease at the smallest radii r £ 0.5/i _1 
Mpc. Although these simulations included many physi- 
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2 4 6 8 
r / h _1 Mpc 

Fig. 11. — Mean intergalactic Lyman-o transmissivity as a func- 
tion of comoving distance from Lyman-break galaxies. Points with 
error bars mark our measurements; refer to the left axis for the 
scale. If galaxies were distributed randomly with respect to the 
Lyman-Q forest, we would have measured (flux/cont) ~ 0.67 at 
each radius (horizontal line). Instead it appears that Lyman-break 
galaxies are associated with neutral hydrogen overdensities (i.e., flux 
decrements) on scales extending to several Mpc, and with HI under- 
densities on the smallest spatial scales. This behavior, especially on 
small scales, contrasts with predictions from numerical smoothed- 
particle hydrodynamic simulations of ACDM universes where galax- 
ies have little impact on surrounding intergalactic matter. In these 
simulations the mean transmissivity drops increasingly rapidly near 
galaxies. Examples for different assumed galaxy redshift errors (as 
labeled, in km s _1 ) are shown with curved lines (data from figure 
9a of Croft et al. 2002, and scaled arbitrarily in y to match roughly 
our data on large spatial scales). The number of galaxies in our 
sample within each radius is shown by the arced staircase (refer to 
right axis). 

cal processes — gravitational and hydrodynamical interac- 
tions, radiative cooling, and radiative heating due to a 
uniform ionized background — they did not include star 
formation and supernova feedback in a way that allowed 
galaxies to have much influence on the nearby intergalac- 
tic medium. It is tempting to conclude from figure 11 that 
the actual impact of galaxies on their surroundings is con- 
siderably larger than in the simulations shown. Much of 
the rest of this section will consider the idea in more de- 
tail (see also Croft et al. 2002). But first we would like to 
discuss the statistical significance of the result. 

Our measurement of the mean transmissivity very close 
to galaxies ( ^ lft," 1 comoving Mpc) should be viewed with 
some skepticism. Recall the large and relatively uncertain 
offsets that were required to estimate galaxies' systemic 
redshifts from the redshifts of the absorption and emis- 
sion lines in their optical spectra (§ 2.2.2). One need not 
contemplate figure 10 for long to realize that even minor 
changes to our estimated systemic redshifts could dras- 
tically alter the inferred Lyman-a transmissivity of the 
intergalactic medium close to Lyman-break galaxies. A 
few judiciously applied redshift adjustments of 8z ~ 0.003 



T 
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Fig. 12. — Histogram of pixel transmissivities at different dis- 
tances from Lyman-break galaxies. The shape of figure 11 de- 
rives from a relative lack of low transmissivity pixels within r ~ 
0.5/i" 1 comoving Mpc of Lyman-break galaxies and an excess with 
1 T 5/i _1 comoving Mpc. 

could easily erase the inflection from figure 11, for example, 
and 8z = 0.003 is hardly a large adjustment. It is small 
compared to the range of redshifts seen in most Lyman- 
break galaxy spectra. It barely exceeds our optimistic es- 
timates of the lcr redshift uncertainty from § 2.2.2. 

One way to assess how badly redshift errors might have 
compromised our estimate of the mean intergalactic trans- 
missivity at different distances from Lyman-break galax- 
ies is to change each of our redshifts by an amount sim- 
ilar to its uncertainty, then recalculate the implied mean 
transmissivity. Figure 13 shows the result. Circles mark 
the average estimated transmissivity at each distance af- 
ter adding a Gaussian deviate with <j z = 0.002 to each 
of our redshifts; vertical error bars show the lcr range ob- 
served when we repeated the exercise many times. Squares 
show the result when the standard deviation of the Gaus- 
sian deviate is increased to a z = 0.004. Since a z — 0.002 
is roughly the redshift uncertainty that follows from the 
analysis of § 2.2.2, we conclude that the true redshifts of 
our galaxies could lie anywhere within our ~ 2a confidence 
intervals without much affecting our conclusion that the 
Lyman-Q! opacity of the intergalactic medium decreases 
near galaxies. 

The small number of galaxies close to the QSO sight- 
lines may be more worrying than our redshift uncertain- 
ties. Only three galaxies contribute to the measurement 
of mean transmissivity at r < 0.5ft." 1 comoving Mpc that 
is shown in figure 11. The situation is not quite as dire 
as the figure suggests, because three additional galaxies in 
our sample lie within r = 0.5ft." 1 comoving Mpc of a QSO 
sight-line. See figure 10. They were excluded from the cal- 
culation shown in figure 11, along with many other galax- 
ies at larger impact parameters, because their low redshifts 
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Fig. 13. — The result of figure 11 recalculated after modifying each 
galaxy's redshift by an amount similar to its uncertainty. Numer- 
ous fake data sets were generated by adding to each galaxy's redshift 
a Gaussian deviate with the standard deviation a z shown. Points 
show the average value of the transmissivity among the fake sets; 
error bars show the la spread. The error bars for <r z = 0.004, sup- 
pressed for clarity, are similar to those for a z = 0.002. The smooth 
curves in the background are a spline fit to the data of figure 11 
and their +lcr upper envelope. If <r z were equal to the uncertainty 
in each galaxy's redshift, then each of the fake data sets would be 
roughly as compatible with our observations as the actual data set, 
and any large differences between the mean transmissivities in the 
fake and real sets would suggest that the mean transmissivity in the 
real set may have been compromised by an unusual combination of 
redshift errors. 

corresponded to parts of their QSO's Lyman-a forest that 
were contaminated by Lyman-/? absorption and correct- 
ing for Lyman-/? was a challenge that we chose to dodge. 
But since Lyman-/3 absorption from gas at higher redshifts 
can only decrease the transmissivity in the Lyman-a for- 
est, there is no reason not to use these galaxies in a con- 
servative search for increases in transmissivity near galax- 
ies. The mean transmissivities within 0.5ft -1 comoving 
Mpc of the two galaxies in figure 10 at impact param- 
eter 14" (~ 0.28k- 1 Mpc for Q M = 0.3, fi A =0.7) are 
(flux/cont) = 0.92, 0.97; the mean transmissivity within 
0.5ft -1 comoving Mpc of the galaxy at impact parameter 
22" (~ 0.46ft -1 Mpc) is (flux/cont) = 0.04. Weighting 
by the length of QSO spectrum that passes within 0.5ft -1 
comoving Mpc of each galaxy, this implies a mean trans- 
missivity of 0.78 for the three — an answer encouragingly 
consistent with the result of figure 11 that was derived 
from three different galaxies. 

Still, six galaxies with impact parameter rg < 0.5ft. -1 
comoving Mpc is not a large sample. The evidence for 
a decrease in HI density within 0.5ft -1 comoving Mpc of 
Lyman-break galaxies has a significance of less than 2cr; 
the difference between our observations and the simula- 
tions in the same bin would have a significance of about 
2(7 if we pessimistically assumed that the accuracy of our 



redshifts was only a z = 300kms - . One might argue that 
even without the data at rg < 0.5ft -1 our measurements 
would not resemble the simulations perfectly, but it may be 
more fruitful to consider instead what physical processes 
could lead to a lack of neutral hydrogen within ~ 0.5ft -1 
comoving Mpc of a galaxy and look for other signatures of 
their existence. 

4.2. Physical origin 

4.2.1. Ionizing radiation 

Reduced Lyman-a absorption is observed in the inter- 
galactic medium surrounding high-redshift QSOs, a fact 
that presumably reflects a decrease in the hydrogen neutral 
fraction in regions where a quasar's radiation overwhelms 
the ambient ionizing field (see, e.g., Weymann, Carswell, 
& Smith 1981; Murdoch et al. 1986; Bajtlik, Duncan, & 
Ostriker 1988; Scott et al. 2000). Could a similar physical 
cause account for the lack of HI absorption in the vicin- 
ity of Lyman-break galaxies? A rough argument suggests 
that this is unlikely to be the case. Consider the mean 
transmissivity within 0.5ft." 1 comoving Mpc of an LBG, 
(flux/cont) £ 0.8, which is substantially higher than the 
mean transmissivity (flux/cont) ~ 0.5 we might expect in 
the absence of a proximity effect (Fig. 11). In the z = 3 
output of the SPH simulation described in White, Hern- 
quist, & Springcl (2001), an increase in the ionizing back- 
ground by a factor of ~ 8 is required to increase the mean 
transmissivity of a random section of the Lyman-a forest 
from 0.5 to 0.8, and so we might guess that the ionizing flux 
within 0.5ft -1 Mpc of a Lyman-break galaxy would have 
to be ~ 8 times higher than its universal mean if ionizing 
radiation from the galaxies were responsible for the ob- 
served proximity effect. The Lyman-break galaxies in our 
sample could not easily produce an ionizing flux so large, 
as the following crude calculation shows. If our sample 
can be approximated as a uniform population of galaxies 
with ionizing luminosity L (energy time -1 frequency -1 ) 
and comoving number density p that produces a fraction 
/b g of the total ionizing background J* ot (energy time -1 
frequency -1 area -1 steradian -1 ) at z ~ 3, then the con- 
tribution to the ionizing background from a single galaxy 
at radius r is 

J 1 = L/(4nrf (10) 

while the net contribution to the ionizing background from 
the population as a whole is roughly 

J^ G ~Lp(d/c){c/^)^f hg Ji ot (11) 

where d is the average distance traveled by an ionizing 
photon before it is absorbed. The ionizing flux will there- 
fore be more than n times higher than its universal average 
jtot within a radius 

r„~ [47rp(n-l)d// bg ] -1 / 2 (12) 

of a Lyman-break galaxy. Assuming an fijvf = 0.3, = 
0.7 cosmology, and substituting d ~ 117ft -1 Mpc (i.e., 
Az ~ 0.18) for the comoving effective absorption distance 
(Madau, Haardt, & Rees 1999) and p ~ 4 x 10 -3 ft 3 Mpc -3 
for the comoving number density of Lyman-break galaxies 
with magnitude 1Z < 25.5, we find 

r n — J fb s /(n - 1) x 0.4ft -1 comoving Mpc. (13) 
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In the extreme case where all of the hydrogen-ionizing 
background is produced by Lyman-break galaxies with 
K < 27 (see, e.g., Steidel, Pettini, & Adelberger 2001), 
only ~ 50% of it will be produced by Lyman-break galaxies 
with TZ < 25.5, provided that the ratio / A (1500A)// A (900A) 
is independent of luminosity and that the rest-frame 1500 A 
luminosity distribution of Lyman-break galaxies is a Schechter 
function with knee 1Z* = 24.5 and faint-end slope a = 
— 1.6 (Adelberger & Steidel 2000), and so we can take 
0.5 as a rough upper limit on /b g . This implies an up- 
per limit of r max ;$ O.l/i -1 comoving Mpc for the radius 
within which ionizing radiation from Lyman-break galax- 
ies could raise the ionizing flux to ~ 8 times its universal 
value and increase the mean transmissivity from 0.5 to 0.8. 
The argument is crude in a number of ways, but the ob- 
served radius with (flux/cont) ~ 0.8 is ~ 5 times larger 
than r max and r max does not depend strongly on any of its 
parameters /, n, d, or p. The change in mean transmis- 
sivity near Lyman-break galaxies appears unlikely to be 
produced solely by the Lyman-continuum radiation they 
emit. 

4.2.2. Galactic superwinds 

Could winds driven by the combined force of numer- 
ous supernova explosions within Lyman-break galaxies be 
responsible instead? Strong winds with velocities exceed- 
ing the escape velocity have been observed around a large 
fraction of starburst galaxies in the local universe (e.g. 
Hcckman et al. 2000). Similar outflows are seen in Lyman- 
break galaxies as well (§ 2.2.2; Pettini et al. 2001; Pettini 
et al. 2002). Though the typical velocity of a Lyman-break 
galaxy's interstellar lines relative to its nebular lines is only 
~ 150km s _ , the velocity exceeds 300km s -1 for roughly 
one third of the galaxies in the sample of § 2.2.2. Moreover 
the interstellar material within an individual Lyman-break 
galaxy does not all flow outward at a single rate. Instead 
a reasonable fraction of the absorbing interstellar material 
has been accelerated to velocities significantly higher the 
mean interstellar velocity. Pettini et al. (2002) found ab- 
sorbing interstellar gas with blueshifts of up to <~ 750 km 
s _1 in MS1512-cB58, for example, a galaxy with a mean 
interstellar blueshift of ~ 250 km s _1 . The large veloc- 
ity widths of most Lyman-break galaxies (a v ~ 180-320 
km s _1 ; Steidel et al. 1996) show that this situation must 
be common. The typical Lyman-break galaxy apparently 
contains absorbing material flowing outwards with a range 
of velocities £ v ;$ 600 km s^ 1 . 

In local galaxies, where similar outflows are observed, a 
distribution of velocities from to v max is generally inter- 
preted as evidence that winds from supernovae are strip- 
ping material from interstellar clouds and accelerating it 
to the velocity v maK (e.g. Heckman et al. 2000). If Lyman- 
break galaxies' absorption spectra were interpreted in the 
same way, one would conclude that most of the absorb- 
ing material will eventually be accelerated to a velocity 
of ~ 600 km s _1 . In Adelberger (2003) we consider the 
implications of 600 km s _1 outflows from Lyman-break 
galaxies in some detail. We argue that ~ 600 km s _1 
winds should be able to escape potential wells as deep 
as those that presumably surround Lyman-break galaxies, 
show that the release of supernova energy implied by the 
galaxies' star-formation rates and stellar masses should be 
sufficient to set massive ~ 600 km s -1 winds in motion, 



and find that these winds would likely travel a distance 
comparable the observed radius of the galaxy proximity ef- 
fect during the typical <~ 300Myr star-formation time-scale 
of Lyman-break galaxies. The upshot is that these winds 
may plausibly have driven intergalactic material from a 
cavity of radius r ^ 0.5 comoving Mpc surrounding each 
galaxy, producing the observed lack of neutral hydrogen 
near the galaxies. In the remainder of this section we will 
look for other evidence that this might be the case. 

4.3. Metals 

If Lyman-break galaxies drive winds into their surround- 
ings, one might expect to see an increase in the number 
density of metal-line absorption systems near them. The 
discussion above and in Adelberger (2003) suggests that 
any material ejected by a Lyman-break galaxy would be 
likely to lie within ~ 0.5 comoving Mpc and to have a 
rcdshift difference ,$ 600 km s _1 . These numbers arc un- 
certain. Aside from the crude spherical model of Adel- 
berger (2003) and its poorly constrained parameters (e.g., 
the fraction of supernova energy that is imparted to the 
winds), there are the complications that winds from dif- 
ferent galaxies will have advanced to different radii, that 
winds slow as they advance, and that much of a wind's 
velocity may be directed perpendicular to the sightlinc. 
Nevertheless the numbers above tell us roughly where we 
should search for metals that might have been ejected from 
Lyman-break galaxies. 

The QSO absorption spectra available to us clearly can- 
not provide perfect information about the distribution of 
metals in the intergalactic medium. In most cases we can 
detect only the absorption due to CIV, and this is what we 
will be forced to rely on in the analysis below. Although 
we will often assume glibly that CIV absorption systems 
are found where the metallicity is highest, we should re- 
mind readers at the outset that the density of CIV is not 
related in a trivial way to the density of metals: the frac- 
tion of carbon that is in the third ionization state can be 
strongly affected by the local gas density and by the in- 
tensity and shape of the ambient radiation field. Still in 
at least some cases we can be reasonably confident that 
the presence of CIV absorption near Lyman-break galax- 
ies signals the presence of significantly enriched gas. Our 
confidence derives from Figure 14, which shows the ratio 
V = n civ/ n m in ionization equilibrium for a (76% H, 24% 
He) gas with solar carbon abundance. This ratio depends 
sensitively on the possible presence of a break at 4Ryd in 
the background radiation field, since reducing the radia- 
tion at hv > 4Ryd makes it harder for CIV to be ionized 
to CV but does not much affect the density of HI. Never- 
theless the plot suggests that most intergalactic gas would 
likely have 0.1 £ rj £ 3, though the ratio could be driven as 
high as <~ 6 if the temperature and density were carefully 
chosen and if the intensity of the background radiation de- 
creased sharply at hv > 4Ryd. If the actual abundance 
of carbon in the gas were [C/H] <~ —2.5, or <~ one 300th 
solar, characteristic of the intergalactic medium at z ~ 3 
(e.g., Dave et al. 1998), one would expect r\ typically to 
be ~ 10 -3 -10 -2 and never to exceed Jj™|fs ~ 2 x 10~ 2 . 
The fact that the observed ratio new I n m in gas close to 
Lyman-break galaxies sometimes exceeds by an or ~ 

der of magnitude (see, e.g., the entry in table 2 for the 
z = 2.7417 system along the SSA22D13 sight-line) shows 
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that the metallicity of this gas must be significantly higher 
than the mean [C/H] ~ —2.5. 




Fig. 14. — The ratio n,crv/ n Hl for a gas with solar carbon abun- 
dance in ionization equilibrium with a uniform radiation field. Den- 
sities are expressed in terms of (pi), the mean proper baryon density 
at 2 = 3. Thin dashed lines are appropriate for the background ra- 
diation field J v = 1.2(/w/13.6eV) _1 - 8 X 1CT 21 erg s" 1 cm" 2 Hz" 1 
sr _1 ; thick solid lines for the same background except with a factor 
of 10 reduction for hu > 4 Ryd. The shaded regions show the tem- 
perature and density of 68%, 90%, and 95% of the baryonic mass 
at z = 3 in the SPH simulation described in White et al. 2001. The 
upper limit nciv/ n Hl ~ 6 for solar carbon abundance shows that 
gas at the mean intergalactic metallicity [C/H] ~ —2.5 should not 
have a ratio nciv/'iHl greater than ~ 0.02. The measured ratio 
in gas near to Lyman-break galaxies can exceed this maximum by 
a large amount, showing that the gas is enriched well beyond the 
intergalactic mean. 

We constructed catalogs of the CIV absorption sys- 
tems in each QSO's spectrum by scanning by eye for dou- 
ble absorption lines with the correct spacing and relative 
strengths for the redshifted CIVA1549 doublet. Rough col- 
umn densities for each system were estimated with the 
equation 

peak Ar ^e 2 , ^CIV MA s 

t£ = N cw few t=, (14) 

m e c <7 v v2ir 

for the peak optical depth in terms of CIV's column den- 
sity New, oscillator strength few, and (Gaussian) veloc- 
ity dispersion a v , after fitting e~ TA , with t\ a Gaussian, to 
both components of the doublet. CIV systems with similar 
redshifts were treated as independent systems if their ve- 
locity differences exceeded twice the quadrature sum of 
their velocity full widths. In the case of Q1422+2309, 
our CIV catalog agrees well with the more carefully con- 
structed catalog of Ellison et al. (2000), though it differs 
somewhat in the arbitrary grouping of neighboring CIV 
systems into single absorption complexes. 

Ten galaxies in our primary sample lie within AO = 35" 
(~ 0.75/i _1 comoving Mpc for Qm = 0.3, £!a = 0.7) of the 
QSO sightline. In nine of the ten cases there is detectable 
CIV absorption in the QSO spectrum within 600 km s _1 



of the galaxy's redshift. The lone exception lies near the 
sightline to SSA22D13, a faint QSO whose moderate res- 
olution spectrum reveals only the strongest CIV systems. 
In three cases, shown in figure 15, the CIV absorption is 
especially strong and absorption due to many species is 
evident. The galaxies shown in the figure, SSA22-MD36, 
Q2233-MD34, and SSA22-C35, lie 17", 31", and 33", re- 
spectively, from their background QSO's sightline; of all 
the galaxies in our sample, their angular separations to the 
sightline are the 3rd, 6th, and 8th closest. The metal-line 
system close to Q2233-MD34 has the largest CIV column 
density of any in our QSO spectra; the system close to 
SSA22-MD36 has the third largest. Does this correspon- 
dence of metal-line systems with galaxies near the sight- 
line imply some physical connection between the two, or 
could it be a coincidence? Should we be surprised that 2 
of the 3 strongest CIV systems in our sample lie within 
Az — 600km s _1 and AO = 35" of a Lyman-break galaxy? 

To address this question, we need some estimate of the 
observed overdensity of Lyman-break galaxies near CIV 
systems relative to what would be expected if they were 
distributed independently. We can estimate the number 
of Lyman-break galaxies that would lie so close to the 3 
strongest CIV systems if galaxies and metal systems were 
independently distributed by generating a large ensemble 
of fake data sets where each galaxy within AO — 35" of 
the sightline is assigned a redshift drawn at random from 
our selection function. See appendix B. Among these fake 
data sets, the mean number of galaxies that lie within 
Az — 600km s -1 of one of the three strongest CIV sys- 
tems is 0.049. Since the observed number _is 2, the implied 
overdensity of Lyman-break galaxies is £ gc ~ 40. Only 
1 time in 1000 will sampling from a Poisson distribution 
with a true mean of 0.049 yield a value of > 2, so we 
can conclude with confidence that strong CIV systems and 
Lyman-break galaxies are not distributed independently. 

But this does not show that the observed metals were 
driven out of the galaxy by a superwind. The number 
density of Lyman-break galaxies in cells centered on other 
Lyman-break galaxies is much higher than the number 
density in randomly placed cells, for example, yet few be- 
lieve that one galaxy ejected the next. Is it possible that 
galaxies and CIV-systems tend to fall near each other for 
the same reason that galaxies fall near each other, because 
they all trace the same large-scale structure? A simple way 
to address this issue is to use the observation of Quashnock 
& Vanden Berk (1998) that the correlation function of 
CIV systems at z ~ 3 is similar to the correlation function 
of Lyman-break galaxies, a fact compatible with the idea 
that CIV systems and Lyman-break galaxies are similar 
objects. Suppose CIV systems and Lyman-break galaxies 
were in fact the same objects, but the CIV associated with 
each galaxy extended only to radii small compared to the 
smallest galaxy-QSO impact parameter in our sample, so 
that the detected CIV absorption could never have been 
produced by a galaxy we observed. In this case we would 
expect the overdensity of galaxies within AO = 35" and 
Az = 600km s _1 of a CIV system to be roughly equal to 
the mean overdensity of Lyman-break galaxies within the 
same distance of another Lyman-break galaxy. Among the 
697 Lyman-break galaxies with the most certain redshifts 
in 13 fields of our survey, 31 unique pairs have a separa- 
tion AO < 35" and Az < 600km s -1 while 3.92 would have 
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Fig. 15. — The Lyman-ct and metal absorption along three QSO sightlines than pass within (from left to right) 17", 31", and 33" (90, 180, 
170 proper h^ 1 kpc) of known z ~ 3 Lyman-break galaxies. Sections of the QSO spectrum corresponding to different species were extracted 
and aligned. The bottom spectral segment shows the HI absorption; upper segments show the associated metal-line absorption as indicated. 
Bluer lines are on the bottom. Redshifts with detected metal lines are marked by arrows. Absorption in the extracted spectral segments can 
be produced by other metal lines at other redshifts, and (in some cases) intergalactic Lyman-o at higher redshift, and the observed absorption 
can believably be associated with metals at the galaxy's redshift only when it is observed in more than one line. Galaxy locations are marked 
with vertical bars, as in Fig. 5; the galaxy closest to the sightline lies in the middle of the extracted spectrum. The velocity offsets between 
the absorption in the QSO spectrum and the galaxy itself (~ 100 — 600 km s — 1 ) are comparable to the measured outflow velocities of the 
galaxies (e.g. Pettini et al. 2001). 
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been expected if the galaxies were distributed uniformly. 
The implied galaxy-galaxy overdensity £ gg — 6.9 is signifi- 
cantly smaller than the measured galaxy-CIV overdensity 
£ gc = 40. This suggests the spatial coincidence of Lyman- 
break galaxies and CIV systems may be too strong to be 
explained by arguing that both trace the same large scale 
structure, that both tend to reside in the same clusters 
and shun the same voids. A more direct connection be- 
tween the observed CIV systems and galaxies appears to 
be required. 

This sort of argument can be formalized with a statis- 
tical inequality derived in appendix A. If / is a discrete 
(i.e. Poisson) realization of the continuous function /', 
g is a discrete realization of the continuous function g', 
and £/ 9 denotes the mean value within some volume of 
the cross-correlation function £/ g between / and g, then 
£/ g can exceed {iff^gg) 1 ^ 2 only if the locations of particles 
in g are influenced by where particles happen to lie in / 
or vice versa. This would be the case if (for example) g' 
were equal to /' and the particles in g were a random sub- 
set of those in /, or if each particle in / were surrounded 
by a cloud of particles in g. The statement holds pro- 
vided the correlation functions are volume averaged with 
one of a class of 3D weighting functions that includes the 
Gaussian. If we could show that the Gaussian-weighted 
cross-correlation function of galaxies and CIV systems £ gc 
exceeded the root-productof the Gaussian- weighted auto- 
correlation functions (CggCcc) 1 ^ 2 , we would have evidence 
that the number density of galaxies in a random volume 
directly affected (or was directly affected by) the number 
density of CIV systems in the same volume. 

The mean overdensity of Lyman-break galaxies in Gaus- 
sian ellipsoids with a z = 350km s -1 and ag = 20" that are 
centered on the strongest three CIV systems in our sample 
is £ S c = 21. The mean overdensity of Lyman-break galax- 
ies in similar ellipsoids centered on other Lyman-break 
galaxies is £ gg = 7. The mean overdensity of CIV systems 
in similar ellipsoids centered on other CIV systems will be 
comparable, £ cc ~ 7, if Quashnock & Vanden Berk's (1998) 
estimate of the CIV correlation function remains accurate 
on small spatial scales. £ gc evidently exceeds {iggicc) 1 ^ 2 by 
a large amount. This may be an aberration. The sample 
is small. But if the result holds in a much larger sample of 
galaxies and CIV systems, we would have solid evidence 
for a physical link between galaxies and the strong CIV 
absorption observed ~ 0.5ft -1 comoving Mpc away. 

In any case, the connection between Lyman-break galax- 
ies and CIV systems clearly weakens as the column density 
of the CIV systems is reduced. If we take the 20 strongest 
CIV systems, for example, rather than the 3 strongest, 
the mean overdensity of Lyman-break galaxies in simi- 
lar Gaussian ellipsoids surrounding the CIV systems is 
Cgc = 9.0. This still exceeds £ gg — 7 ~ £ cc , but proba- 
bly not by a significant amount. The mean overdensity in 
Gaussian ellipsoids surrounding every CIV system in our 
sample is a paltry £ gc = 2.8, showing that there is little 
evidence that the weakest CIV systems are directly asso- 
ciated with the observed galaxies. The dependence of the 
cross-correlation strength £ gc on column-density is shown 
in figure 16. Also shown is £ gg and its uncertainty. Cylin- 
drical cells with AO < 35" and Az < 600km s -1 were used 
when calculating the mean overdensities £, rather than the 



ellipsoidal Gaussians discussed above. This simplified the 
calculation of the uncertainties in £, which we optimisti- 
cally took to be Poisson, but it means that £ gc does not 
formally need to satisfy inequality A7. But similar con- 
clusions about the column densities where A7 is and is 
not satisfied would follow had we averaged in Gaussian 
ellipsoids instead. 



80 



60 



\J° 40 



20 



E 
10 



1 1 



10 



12 







v 



CTV 



13 



cm" 



10 



14 



10 



15 



Fig. 16. — The observed overdensity of CIV systems within 
Az = 600 km s _1 and A8 = 35" of a Lyman-break galaxy as a 
function of CIV column density. The spatial association of Lyman- 
break galaxies and CIV systems at these separations grows rapidly 
stronger as the CIV column density increases. The shaded horizon- 
tal region shows the overdensity of Lyman-break galaxies within a 
similar distance of another Lyman-break galaxy. As explained in 
§ 4.3, the presence of CIV points above this region suggests strong 
CIV systems and Lyman-break galaxies may be identical objects. 

It is interesting to address the association of galaxies 
and metals in a slightly different way. What fraction 
of detectable CIV absorption is produced by gas within 
A9 < 35", Az < 600km s -1 of a Lyman-break galaxy? Be- 
cause only a small fraction of the Lyman-break galaxies 7 in 
any field at z ~ 3 are included in our spectroscopic sample, 
we would not expect most CIV systems to lie near a galaxy 
in the sample even if every CIV system had a Lyman- 
break galaxy nearby. Limited observing time allowed us 
to obtain spectra for fewer than half of the photometrically 
detected Lyman-break galaxies in each field, for example, 
and Monte-Carlo simulations (Steidel et al. 1999) suggest 
that only ~ 50% of all Lyman-break galaxies are included 
in our photometric sample even at z = 3.0 where our se- 
lection is most efficient. We know our sample is seriously 
incomplete; but we can attempt to correct for this and esti- 
mate what fraction of CIV systems would have been found 
to lie within AO < 35", Az < 600km s" 1 of a Lyman-break 
galaxy if we had been able to measure the redshifts of ev- 
ery Lyman-break galaxy our fields. The result is shown 

7 i.e., the galaxies with optical magnitude 1Z ,$ 25.5 and dust red- 
dening < E(B - V) £ 0.5 that have a non-zero probability of sat- 
isfying our photometric selection criteria; see Steidel et al. (1999). 
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in figure 17. Circles mark the fraction /lbg of detected 
CIV systems with different column densities that lie within 
AO < 35", Az < 600km s -1 of a galaxy in our spectro- 
scopic sample. Stars mark the expected fraction if every 
CIV system lay within this distance of one (and only one) 
Lyman-break galaxy. Crosses mark the expected fraction 
if CIV systems and Lyman-break galaxies were distributed 
independently. At column densities A*civ ~ 10 135 cm -2 
the observed number of Lyman-break galaxies close to 
CIV systems significantly exceeds the number expected 
if only one Lyman-break galaxy lay within A9 < 35", 
Az < 600km s -1 of each CIV system. This apparently 
shows that the strongest CIV absorption is produced in gas 
with several Lyman-break galaxies nearby, an interesting 
observation for which we have no ready explanation. Con- 
fidence limits on the fraction of CIV systems that have 
at least one Lyman-break galaxy within AO < 35" and 
Az < 600km s 1 can be crudely derived by estimating the 
mean number of Lyman-break galaxies within this distance 
of a CIV system, N^bg, then using Poisson statistics to 
work out how frequently sampling from a distribution with 
this mean would yield one or more galaxies (1 — e~ AfLBG ). 
Averaging over the range of Albg compatible with the 
data leads to the rough confidence intervals shown in fig- 
ure 17. The data evidently suggest that the majority of 
CIV absorption with Vciv ~ 10 13 cm~ 2 is produced by gas 
that lies no farther than AO < 35" and Az < 600km s -1 
from a Lyman-break galaxy, i.e., by gas that could plausi- 
bly have been ejected from the galaxies in an outflow. 

How might weaker metal-line absorption systems with 
Vciv ~ 10 13 cm -2 be associated with Lyman-break galax- 
ies? This question can be straightforwardly addressed with 
results presented in § 5 below. The numerous weak CIV 
absorbers dominate the cross-correlation function of galax- 
ies and CIV systems, which (as we will show) is roughly a 
power-law of the form £ gc = (r / V gc )~ 7EC with r gc ~ 3.2/i _1 
comoving Mpc and 7 gc = 1.6. The mean number of 
Lyman-break galaxies within a distance r of a randomly 
chosen CIV system is therefore 



Vlbg - g7rr "lbg | 1 



3 r -7 EC 



(3 7gc)?"gc 



(15) 



where tilbg — 4 x 10 _3 /i 3 Mpc -3 is the comoving num- 
ber density of Lyman-break galaxies with magnitude 1Z < 
25.5. This shows that the numerous weak CIV systems 
will typically have ~ 1 Lyman-break galaxy within a co- 
moving distance of ~ 2Ah~ 1 Mpc. If they were ran- 
domly distributed, the nearest Lyman-break galaxy would 
lie ~ 3.9/i _1 Mpc away. Weak CIV systems also tend to 
lie close to Lyman-break galaxies, though not as close as 
their higher column-density counterparts. 

In summary, it appears that metals in the intergalactic 
medium are closely connected to the star-forming galaxies 
we observe. Most CIV systems lie within ~ 2.4/i _1 Mpc 
of a Lyman-break galaxy; higher column density systems 
tend to lie closer still; and the highest column density sys- 
tems are so strongly correlated with Lyman-break galaxies 
that we may be forced to conclude that they are the same 
objects, a result that is remarkable because the absorbing 
gas is typically ~ 0.5/i _1 comoving Mpc from the galaxies' 
stars. 

We are aware of two objections to the implied link 
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Fig. 17. — The fraction /lbg of detected CIV systems that lie 
within Az = 600 km s" 1 and A6 = 35" of a Lyman-break galaxy 
as a function of CIV column density. Circles mark the fraction 
that lie within this distance of a galaxy in our spectroscopic sample. 
Stars mark the expected fraction if each CIV system had one galaxy 
within this distance. Crosses mark the expected fraction if CIV 
systems and Lyman-break galaxies were independently distributed. 
The low value of /lbg f° r the stars reflects the severe incompleteness 
of our spectroscopic sample; simulations suggest that only ~ one 
fifth of Lyman-break galaxies with redshifts similar to those of the 
CIV systems would satisfy our photometric selection criteria and be 
included in our final spectroscopic catalogs. Error bars mark the 
estimated fraction of CIV systems that lie within Az = 600 km s — 1 
and A6* = 35" of a Lyman-break galaxy once our selection effects 
are corrected. 



between intergalactic metals and Lyman-break galaxies' 
winds. The large HI column densities of the metal-line 
systems significantly exceed naive expectations based on 
the high expected post-shock temperature of fast-moving 
winds (Adelberger 2003). The constant observed inter- 
galactic CIV density at redshifts 1.5 < z < 5.5 may im- 
ply that metal enrichment of the intergalactic medium 
occurred at z > 5.5 and not at the lower redshifts we 
have observed (Songaila 2001). Both objections are rea- 
sonable; neither is damning. Cooling due to a plausibly 
high metallicity in the outflows could account for the large 
observed HI column densities. Metals ejected by winds 
might later be driven back into collapsed objects by the 
continued action of gravitational instability (Adelberger 
2003); in this case the constant intergalactic CIV density 
at 1.5 < z < 5.5 would primarily reflect the constant co- 
moving star- formation density (e.g., Steidel et al. 1999) 
at similar redshifts. 

4.4. Interstellar absorption lines 

We conclude this section with a brief discussion of the 
gas that produces the blueshifted absorption lines in the 
spectra of Lyman-break galaxies. These absorption lines 
are often called "interstellar," because they resemble the 
absorption lines produced by interstellar material in local 
galaxies. But is it necessarily true that the absorbing ma- 
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terial lies between the galaxy's stars? Or might it instead 
lie far from the stars, perhaps at the front of a shock that 
has advanced ~ 0.5ft, -1 comoving Mpc? 

It is relatively easy to rule out the idea that the inter- 
galactic CIV absorption observed within ~ 0.5ft -1 Mpc 
of Lyman-break galaxies is produced by the same gas re- 
sponsible for the galaxies' own absorption lines. The inter- 
galactic equivalent widths are too small. This is illustrated 
by Figure 18, which compares the mean interstellar CIV 
absorption in a sample of ~ 800 Lyman-break galaxies 
(Shapley et al. 2003) to the CIV absorption produced by 
the intergalactic gas that lies ~ 0.35ft, -1 comoving Mpc 
from the Lyman-break galaxy SSA22-MD36. With a col- 
umn density ATciv — 10 4 (see table 2), this intergalac- 
tic CIV absorption is among the strongest in any of our 
QSO spectra. But even though its column density is ~ 2 
orders of magnitude larger than the typical column den- 
sity in our intergalactic sample, the resulting absorption 
equivalent width is far smaller than observed in the av- 
erage Lyman-break galaxy's spectrum. We have argued 
that winds from Lyman-break galaxies may reach comov- 
ing radii approaching ~ 0.5ft -1 Mpc, but we cannot claim 
that most of the outflowing gas responsible for the galax- 
ies' absorption lines has traveled so far. 

The lack of galaxies close to the QSO sightlines prevents 
us from using the QSO spectra to place tighter limits on 
the typical distance between Lyman-break galaxies' stars 
and interstellar gas. But a comparatively large number 
of Lyman-break galaxies lie near sightlines towards other 
Lyman-break galaxies at higher redshift. Since the spec- 
tra of the background galaxies are generally good enough 
to reveal the background galaxies' own CIV absorption, 
they should also be good enough to reveal CIV absorption 
from a foreground Lyman-break galaxy — if the foreground 
galaxy's CIV absorption were mainly produced by gas at 
large impact parameters. 17 Lyman-break galaxies in our 
complete sample have a background Lyman-break galaxy 
at an impact parameter of rg < 0.16ft -1 comoving Mpc. 
Among these 17 pairs, the smallest and mean impact pa- 
rameters are 0.07ft. -1 and 0.11ft. -1 comoving Mpc respec- 
tively. The pairs are generally chance alignments rather 
than physical associations; all but four have redshifts that 
differ by more than Az = 0.1 (~ 70ft. -1 comoving Mpc 
for FIm — 0.3, f?A = 0.7). Figure 19 shows the mean 
Lyman-a and CIV absorption at the redshift of the fore- 
ground galaxy that we observe in the spectra of the 17 
background galaxies. In calculating the mean absorption, 
we discarded parts of the spectra with A < 4000A, parts 
that were contaminated by telluric absorption or by resid- 
uals from bright sky lines, and parts that overlapped with 
any of the background galaxy's ten strongest absorption 
lines. Each spectrum was then multiplied by a constant 
to give it a mean flux of unity within ±5000kms -1 of the 
foreground galaxy's absorption line. Little absorption is 
apparent at the redshifts of the foreground galaxies. We 
conclude that the typical Lyman-break galaxy's interstel- 
lar absorption lines arise in gas that lies within ~ 25ft. -1 
proper kpc of its stars. This conclusion is consistent with 
the idea that the interstellar absorption is produced by 
pockets of cold gas entrained in a hot outflow that destroys 
them before they reach large radii (e.g., Klein, McKee, & 
Colella 1994 §9.2). It is equally consistent with the idea 
that Lyman-break galaxies' winds rarely propagate so far. 




-2000 2000 
Av / km s -1 

Fig. 18. — Intergalactic and interstellar CIV absorption. The 
thick line in the foreground shows the mean CIV absorption ob- 
served in the spectra of Lyman-break galaxies (Shapley et al. 2003, 
in preparation). The equivalent widths implied by the upper and 
lower edges of this line differ by ±3<r from the mean equivalent width 
reported by Shapley et al. The darker shaded region in the back- 
ground shows the CIV absorption produced by the intergalactic gas 
that lies ~ 0.35?i — 1 comoving Mpc from the Lyman-break galaxy 
SSA22-MD36; it is a small piece of our spectrum of a background 
QSO that is offset from SSA22-MD36 by 17". The QSO spectrum 
was smoothed by a Gaussian with <r = 200kms _1 to approximate 
the resolution of the galaxy spectrum. The intergalactic CIV ab- 
sorption system is among the strongest in our entire sample, yet its 
equivalent width is small compared to the CIV equivalent widths 
observed in the spectra of Lyman-break galaxies themselves. Inter- 
galactic CIV absorption may sometimes be produced by material 
ejected from Lyman-break galaxies, but not by the same material 
that produces the galaxies' own interstellar absorption lines. 

5. CORRELATION FUNCTIONS 

The previous sections were primarily concerned with 
spatial averages over the auto- and cross-correlation func- 
tions of galaxies, CIV systems, and Lyman-a forest flux 
decrements. These averages highlighted the aspects of the 
spatial association of galaxies, metals, and gas that we 
found most interesting. But readers may wonder how our 
conclusions would alter if we averaged the correlation func- 
tions over different volumes, or may wish to know how 
galaxies and intergalactic material are associated on spa- 
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Fig. 19. — Similar to Figure 18, except this plot compares Lyman- 
break galaxies' strong observed interstellar Ly-a and CIV absorption 
(curves shaded beneath) to the undetected absorption that fore- 
ground Lyman-break galaxies produce in the spectra of background 
Lyman- break galaxies at small impact parameters 20 < rg < 40h~ 1 
proper kpc (points). Seventeen galaxy pairs have impact parame- 
ters so small. The points' vertical error bars show the rms disper- 
sion in relative flux at each velocity offset among many sets of 17 
galaxy pairs drawn at random (with duplication allowed) from the 
17 galaxy pairs with 20 < rg < 40/i _1 proper kpc. 



tial scales other than those we considered. This section 
presents the correlation functions with no spatial averag- 
ing imposed. 

5.1. Two dimensional 

Figures 20 and 21 show the two dimensional correlation 
functions of galaxies with galaxies, of galaxies with de- 
tected CIV systems, and of galaxies with Lyman-a forest 
flux decrements. The data used to generate the two fig- 
ures were identical; figure 21 is less heavily smoothed to 
help bring out the behavior of the correlation functions on 
small spatial scales. The abscissae correspond to angular 
separations rg on the plane of the sky; the ordinates corre- 
spond to redshift separations r z . Observed separations in 
arc-seconds and in redshift were converted to the comoving 
distances implied for an £Im = 0.3, f^A = 0.7 cosmology. 
The value of the correlation function £ gc between galaxies 
and CIV systems at separation rg, r z was estimated with 
the statistic 

^ = (DgD C - DgR C ~ Rg D C + RgR C ) / RgR C (16) 

(e.g. Landy & Szalay 1993) where D g D c is the observed 



number of galaxy-CIV-system pairs with separation rg,r z , 
D g R c is the number of pairs with the same separation be- 
tween our galaxy catalog and a random catalog of CIV 
systems, and R g R c is the number of pairs with the same 
separation between a random galaxy catalog and random 
CIV catalog. The correlation functions of galaxies with 



Lyman-a flux decrements and £ g9 of galaxies with galax- 
ies were estimated similarly. The random galaxy catalogs 
were generated by assigning each galaxy in the true cat- 
alog 1000 redshifts drawn at random from our selection 
function; the angular positions of galaxies in the random 
and real catalogs were the same. The random CIV cata- 
logs were generated by assigning each CIV system in the 
true catalog 1000 redshifts drawn from a uniform distribu- 
tion between the minimum and maximum redshifts where 
our QSO spectra allowed us to detect CIV systems; the 
observed number density of detected CIV systems among 
our QSO spectra at each redshift is closely approximated 
by a constant once the different redshift selection ranges 
are taken into account. The random flux decrement spec- 
tra were linear functions with the form of equation 8, but 
scaled to match the observed mean transmissivity of each 
QSO spectrum. The galaxy-galaxy correlation function 
was calculated using only data in the redshift range 2.6 < 
z < 3.4, to minimize the effect on our result of the poorly 
determined wings of the selection function. The calcula- 
tion of the galaxy-flux cross-correlation function excluded 
data with z < 2.6, with z > 3.4, with z > zqso — 0.05, and 
with z < (1 + z QSO ) x 1026/1216 - 1. The final criterion 
excludes portions of the Lyman-a forest that are contami- 
nated by Lyman-/3 absorption from material at higher red- 
shifts. Also excluded were data at the DLA-contaminated 
redshifts 2.91 < z < 2.98 in SSA22 and 3.214 < z < 3.264 
in Q0933+2841. The galaxy-CIV cross-correlation func- 
tion calculation excluded data with z < 2.6, z > 3.4, 
z > z QSO ~ 0.05, or z < (1 + z QSO ) X 1216/1549 - 0.99. 
The last two criteria exclude CIV lines that lie close to the 
QSO redshift or in the Lyman-a forest. To help guide the 
eye, the resulting raw two-dimensional correlation func- 
tions were finally smoothed by two-dimensional Gaussians 
with og — a z = 0.5/i _1 (figure 20) or ag = cr z — 0.2/i _1 
(figure 21) Mpc comoving. 

Aside from some elongation in the redshift direction at 
small separations, which is at least partly due to errors in 
galaxy redshifts, these two-dimensional correlation func- 
tions are reasonably isotropic. This can be seen in fig- 
ure 22. The figure shows the average value of each corre- 
lation function £ in three-dimensional ellipses of comoving 
volume 47r(4ft," 1 Mpc) 3 /3 as a function of axial ratio r z /rg. 
The galaxy-galaxy correlation function is the most elon- 
gated in the redshift direction, reaching a maximum of 
£ at r z /rg ~ 1-1.6. This is presumably because galaxy 
redshift errors have twice the impact and because close 
galaxy pairs tend reside in dense parts of the universe 
where the pair- wise velocity dispersion is highest. But the 
two averaged cross-correlation functions both hit an ex- 
tremum at r z /rg — 1.0±0.2, demonstrating their isotropy 
on ~ 4/i _1 Mpc scales. If one were convinced that the 
cross-correlation functions should be isotropic despite pos- 
sible peculiar velocity distortions and despite the signifi- 
cant errors in our galaxy redshifts, figure 22 would show 
that the Alcock-Paczynski (1979) test at z ~ 3 favors a 
A cosmology; if we had assumed SIm = 1.0, £1a = or 
Qm = 0.3, Oa = instead of Am = 0.3, fl\ = 0.7, we 
would have found that the averages hit their extrema at 
r z /rg ~ 0.8 not r z /rg ~ 1.0. In any case, one unsur- 
prising implication of the lack of redshift elongation in 
the cross-correlation functions is that the bulk of inter- 
galactic absorbing material is not flowing outwards with 
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Fig. 20. — The two dimensional auto- and cross-correlation func- 
tions of galaxies with galaxies, with CIV systems, and with inter- 
galactic Lyman-a transmissivity. The galaxy-transmissivity corre- 
lation function is multiplied by —10. Each correlation function was 
smoothed by a two-dimensional Gaussian with a = 0.5h~ 1 comov- 
ing Mpc. 
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Fig. 21. — The correlation functions of figure 20 less heavily 
smoothed (<r = 0.2h _1 comoving Mpc). 



respect to nearby galaxies; the peculiar velocities of in- 
tergalactic material relative to galaxies is not sufficient 
to make the galaxy-intergalactic material cross-correlation 
functions be significantly more elongated than the galaxy- 
galaxy correlation function. There is no contradiction with 
the galactic superwind hypothesis because a wind's evolu- 



tion is dominated by its stalling phase and because in any 
case only a small fraction of the intergalactic volume would 
be affected by active winds at any time (e.g., Adelberger 
2003). 
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Fig. 22. — Average value £ of the auto- and cross-correlation 
functions of figures 20 and 21 within ellipses of volume 47r(4h _1 ) 3 /3 
comoving Mpc 3 as a function of the axial ratio r z /rg. £ reaches an 
extremum close to r z /rg = 1 for each correlation function, showing 
that they are reasonably isotropic on these scales. 

The scale dependence of correlation functions is often 
easiest to apprehend when they are presented as one- 
dimensional rather than two-dimensional functions. It 
would be trivial to average the correlation functions of 
figure 20 in circular annuli, but because of peculiar ve- 
locities and redshift measurement errors, a more robust 
estimate of the one-dimensional correlation functions will 
come from another approach. 

5.2. One dimensional 

Appendix C describes the approach adopted by Adel- 
berger (2000) to estimate the one-dimensional auto-correlation 
function of Lyman-break galaxies at z ~ 3. Following 
Davis & Peebles (1983), we first estimated the projected 
correlation function, which is a marginalization in the red- 
shift direction of the two dimensional correlation function 
(e.g., a marginalization of figure 20 or 21). We then esti- 
mated the shape of the one-dimensional correlation func- 
tion from a power-law fit to the projected correlation func- 
tion. See appendix C. An identical approach can be used 
to estimate the cross-correlation function of galaxies with 
CIV systems. Figures 23 and 24 show the auto-correlation 
function of the ~ 700 galaxies with the most secure red- 
shifts in the Lyman-break sample of Steidel et al. (2003; in 
preparation) and the cross-correlation function of galaxies 
and CIV systems in the sample of this paper. In both cases 
the data appear to be reasonably well fit by a power-law of 
the form £(r) = (r/r0)~ 7 , justifying the method we have 
adopted. The best-fit values and la uncertainties of 7 and 
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ro are shown in the figures. 
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Fig. 23. — The projected correlation function of Lyman-break 
galaxies. The parameters of the best power-law fit to the three- 
dimensional correlation function £(r) = (r/ro) 7 are shown. 
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Fig. 24. — The projected cross-correlation function of CIV sys- 
tems and Lyman-break galaxies. 

The galaxy-CIV cross-correlation function resembles the 
galaxy-galaxy correlation function to a large extent. The 
slopes are identical; the correlation lengths differ by only 
~ 20%. This suggests that galaxies and CIV systems may 
be similar objects, a point that has been made by Sargent, 
Steidel, & Boksenberg (1988), by Quashnock & Vanden 



Berk (1998), and by us (§ 4.3 above) on slightly different 
grounds. 

Unfortunately we were unable to estimate the cross- 
correlation function between galaxies and Lyman-a trans- 
missivity with a similar approach. The difficulty is that 
(after the first ~ Mpc) the HI content of the IGM ap- 
pears to decline quite slowly with increasing distance from 
a galaxy. This can be guessed from figure 11, but is 
seen most robustly in figure 25, which shows cuts along 
the redshift direction through the two-dimensional galaxy- 
transmissivity cross-correlation function. To improve the 
statistics, data for rg < 1.1ft -1 Mpc were estimated 
from the absorption that foreground LBGs produce in 
the spectra of background LBGs (see § 4.4 for more de- 
tails); at larger separations the data are the same as 
in figure 20. One must advance to a impact parameter 
rg £ 4ft -1 comoving Mpc before the net Lyman-a absorp- 
tion within velocity separations —600 < At; < 600kms~ 1 
decreases to half its value at the smallest impact param- 
eters rg lft -1 Mpc. This implies that the slope j s f of 
the galaxy/transmissivity correlation function lies precar- 
iously close to the values j s { < 1 that cause our statistical 
deprojection (equation C2) to break down entirely. Al- 
though the best formal fit of a power-law correlation func- 
tion to the data in Figure 25 (marginalized by ±6ft _1 co- 
moving Mpc, or ~ ±670kms x ) has r Q ~ 0.49ft" 1 comov- 
ing Mpc, 7 ~ 1.15, the data are almost equally compatible 
with 7 g f < 1, and that complicates our attempts to place 
confidence intervals on the parameters. In any case the 
best fit parameters depend uncomfortably on the distance 
over which we marginalize the data. The problem is that 
the measured mean transmissivity stays slightly lower than 
the global mean even at large distances from galaxies (cf. 
figure 11), and it is unclear whether this is part of the true 
correlation function or merely a ~ percent level systematic 
error due (e.g.) to our continuum fitting. Fitting the two- 
dimensional data directly with a power-law smoothed by a 
Gaussian in the redshift direction does not help. None of 
our fits filled us with complete confidence. We shall defer 
a proper estimate of the galaxy-transmissivity correlation 
function. 

The uncertainty in the shape of the galaxy-flux corre- 
lation function does not stop us from making one final 
observation: the ratio of CIV to HI in the intergalactic 
medium almost certainly increases as one approaches a 
Lyman-break galaxy. The mean Lyman-a transmissivity 
of the intergalactic medium does not change much with 
distance from a Lyman-break galaxy, only by £ 30% (cf. 
figure 11), so it may be a reasonable first approximation 
to assume that the density of intergalactic HI is roughly 
independent of distance to a Lyman-break galaxy. But 
the number density of CIV systems depends very strongly 
on distance to a Lyman-break galaxy; according to the 
power-law fit described above, the density of CIV systems 
is ~ 6.4 times higher at r — 1ft -1 comoving Mpc than at 
r = 10ft -1 Mpc. Readers may wish to refer to Figure 26, 
which shows marginalizations by ±6ft _1 comoving Mpc in 
the redshift direction over the two-dimensional correlation 
functions of Figure 20. The marginalization reduces the 
differences between the correlation functions somewhat, 
but nevertheless they are clear. The significantly different 
strengths of the galaxy-CIV and galaxy-transmissivity cor- 
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Fig. 25. — Cross-cuts through the galaxy-transmissivity cross- 
correlation function (plus one) at different angular separations (as 
marked, in h~ x comoving Mpc). The line for 4 < rg < 5.5h~ 1 Mpc 
(for example) shows how the typical Lyman-a transmissivity of the 
intergalactic medium varies along a line that passes a Lyman-break 
galaxy at an impact parameter of ~ 4—5 Mpc. The transmissivity 
is near the global mean (/} at large redshift separations from the 
galaxy, begins to fall within ~ 1000kms~ 1 of the galaxy, and reaches 
a minimum near the galaxy's redshift. Data for r < 0.65/i Mpc 
were calculated from the absorption that foreground LBGs produce 
in the spectra of background LBGs; the galaxy proximity effect 
should not be detected in this data because the resolution of our 
galaxy spectra (~ 500kms -1 FWHM, or ~ 4.5/i -1 comoving Mpc) 
significantly exceeds the ,$ lh~ 1 comoving Mpc diameter of the re- 
gion around each galaxy that appears to have a lowered HI content. 
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Fig. 26. — Marginalizations over < r z < 6h~ 1 comoving Mpc 
of the data shown in figure 20. Points with error bars show the 
marginalized galaxy-galaxy, galaxy-CIV system, and galaxy/Ly-a 
transmissivity correlation functions plus one. The plot shows (for 
example) that a cylinder of depth r z = ±6/i — 1 Mpc and radius 
rg = 0.65/i — 1 Mpc centered on a typical LBG will contain ~ 9 
times the number of (other) LBGs and ~ 3.5 times the number of 
CIV systems as a similar cylinder randomly placed. Other bins of 
rg on the plot correspond to volumes that are cylindrical annuli 
centered on LBGs rather than cylinders. To improve the statistics, 
the galaxy-HI measurement at the smallest separation was estimated 
from the absorption that foreground LBGs produce in the spectra 
of background LBGs; see § 4.4. Striations in the background show 
how this plot would appear if the unmarginalized density had the 
form p(r) oc 1 + (r/ro)~ 1 ' 6 . 



relation functions make it seem almost inevitable that the 
ratio of intergalactic CIV to HI density increases dramat- 
ically close to Lyman-break galaxies — though we should 
emphasize again that neither the mean Lyman-a trans- 
missivity nor the number density of CIV systems is related 
in a simple way to column density. One may demonstrate 
through simple calculations, omitted at the referee's re- 
quest, that increasing the intensity of the radiation field 
at z ~ 3 would lead to a decrease in the CIV content 
of most of the Lyman-a forest. This shows that radia- 
tion from Lyman-break galaxies is unlikely to account for 
any enhancement in CIV density near to them. Figure 14 
shows that plausible changes in the intergalactic density 
and temperature near the galaxies would be unlikely to 
change nciv /"■hi by a factor of ~ 6. We conclude that the 
simplest explanation for the (apparently) strong change of 
new I ^hi with radius may be that the intergalactic metal- 
licity is higher near the galaxies. See Steidel et al. (2003; 
in preparation) for a less superficial analysis. 

6. SUMMARY 

The goal of this paper was to compare the spatial dis- 
tributions of galaxies, metals, and neutral hydrogen at 
high redshift. We conducted a redshift survey of z ~ 3 



Lyman-break galaxies in 6 primary fields containing QSOs 
at 3.1 ^ z ^ 3.6 whose spectra revealed the locations of 
HI and metals along sightlines through the galaxy distri- 
butions. We quantified the relationship between galaxies 
and intergalactic material by calculating two-point corre- 
lation functions or closely related statistics. Various aver- 
ages over the correlation functions were presented in §§ 3 
and 4; the one-dimensional and two-dimensional correla- 
tion functions of galaxies with galaxies, with CIV systems, 
and with Lyman-a forest transmissivity were presented in 
§ 5. The most lasting contribution of this paper may be 
its methodology, which will be applied to a larger sample 
in the future. Applying these statistical methods to the 
present sample leads to the following principal conclusions: 

1 . The HI content of the intergalactic medium is closely 
correlated with the positions of nearby galaxies. The ma- 
terial within r ~ 0.5/i _1 comoving Mpc of Lyman-break 
galaxies absorbs Lyman-a photons less heavily than the 
material at random points in the intergalactic medium 
(§§ 4.1, 4, and 5); the material at slightly larger radii 
1 ^$ r !S 5/i _1 Mpc absorbs Lyman-a photons more heav- 
ily (§§ 3.1 and 5) than average. Interested readers will 
find in Paper II a more detailed discussion of the correla- 
tion between galaxies and intergalactic HI on large spatial 
scales. 
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2. The gas within the largest galaxy overdensities at 
z ~ 3, presumably the young intraclustcr medium, is rich 
in HI and CIV (§§ 3.1, 3.2, and 5). 

3. The cross-correlation function of galaxies with CIV 
systems is similar to the auto-correlation function of galax- 
ies with galaxies. Both appear to be power-laws; the 
best fit parameters to a correlation function of the form 
£(r) = (r/ro)^ are r$ — 3.2±0.3, 4.0±0.3 comoving hr 1 
Mpc and 7 = 1.60 ± 0.20, 1.55 ± 0.15 for the galaxy-CIV 
and galaxy-galaxy correlation functions respectively (ler; 

= 0.3, fl\ = 0.7). This shows that CIV systems and 
Lyman-break galaxies are found in similar parts of the uni- 
verse and suggests that they may be similar objects (§ 5). 

4. The ratio of the number density of CIV systems to HI 
density (§ 5) and to HII density (§ 3.2) almost certainly in- 
creases within a few comoving Mpc of Lyman-break galax- 
ies. Most detectable CIV systems (N C w ~ 10 115 cm" 2 ) 
lie within 2Ah~ 1 comoving Mpc of a Lyman-break galaxy 
(§ 4.3). Stronger CIV systems tend to lie even closer. The 
strongest CIV systems (A^civ ~ 10 13,5 cm -2 ) appear to be 
so strongly correlated with nearby (r 0.5ft. -1 comoving 
Mpc) Lyman-break galaxies that they should be consid- 
ered the same object — despite the large distance between 
the galaxy and CIV-absorbing gas (§ 4.3). 

5. Damped Lyman-a systems appear to reside in dif- 
ferent environments than Lyman-break galaxies (§ 3.4). 
Though the sample is small and the statistics are poor, 
the available data suggest that they should not be consid- 
ered similar objects (cf. Fynbo et al. 1999, Haehnelt et 
al. 2000, Gawiser et al. 2001). 

6. One of our surveyed volumes surrounds the line 
of sight to Q0302-0019, a QSO whose spectrum reveals 
patchy Hell absorption. Comparing the Hell spectrum 
of this QSO with our galaxy and CIV data suggests that 
Hell tends to be absent in regions of higher galaxy density 
and in regions with a higher incidence of CIV absorption 
(§3.3). Presumably it has been ionized to HcIII. 

Readers may find in these results some indications that 
we have observed powerful supcrwinds emerging from 
Lyman-break galaxies. In particular, the lack of HI within 
<~ 0.5/i _1 Mpc of the galaxies and excess of CIV at slightly 
larger radii could be neatly explained if the ~ 600 km s _1 
outflows common to Lyman-break galaxies (§ 2; Pettini 
et al. 2001, 2002) are able to escape the galaxies' po- 
tentials and maintain their large velocities throughout the 
galaxies' ~ 300Myr (Shapley et al. 2001) star-formation 
timcscale. This idea will be considered more fully in sub- 
sequent papers of our series. The present paper is merely 
a first report. 
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APPENDIX A 

A LIMIT ON THE ASSOCIATION OF TWO RANDOM FIELDS 

Consider two standard ways of picturing the relationship between galaxies and intergalactic metals. The metals might 
lie only within some radius of the galaxies — for example, within the galaxies' extended gravitational potentials or within 
the largest radius reached by the galaxies' winds. Or they might be truly intergalactic but still statistically correlated 
with galaxies because galaxies are found in parts of the universe where the density of other sorts of matter is high. One 
can safely assert that the truth lies between these two extremes. But one need not stop with this meek conclusion. Our 
goal is to illustrate that it is possible, at least in principle, to tell which of the two pictures lies closer to the truth. Various 
approaches could be taken; ours is to derive a statistical inequality that would be satisfied by the cross-correlation function 
of galaxies and CIV systems in the second picture but not necessarily the first. The inequality is violated by our data, 
suggesting a direct connection between intergalactic metals and nearby galaxies in at least some cases. 

We begin by deriving the inequality. Let /(r), g(r), /fe(k), and <?fc(k) be two arbitrary random fields and their Fourier 
transforms, /(r) and g(r) might represent the density of CIV systems and of galaxies at spatial position r. The Cauchy- 
Schwarz inequality in the form 

\(a*b)\ 2 < (a*a)(b*b) (Al) 

leads to the following two inequalities: 

4W < £//(0)£ 9S (0), (A2) 
where £// is the auto-correlation function of / and £j 9 is the cross-correlation function of / with g, and 

|P/ 9 (k)| 2 < P ff (k)P gg {V), (A3) 

where P// is the power-spectrum of / and Pf g is the cross-spectrum of / and g. Equations A2 and A3 result from inserting 
a = f(r'), b = g(r — r') and a = .ffe(k), b = .gfc(k) into equation Al. 

Equation A2 does not appear to be particularly useful, because the RHS is independent of r and depends on the value 
of the correlation function at zero lag, which is often difficult to estimate. Fortunately the possible association of / and 
g in real space can be limited by a large number of other inequalities. Consider, for example, the mean value of the 
correlation function £/ s weighted by an arbitrary function W(r), 

ify = J d\^ g {v)W{v)/ J d 3 rW(v). (A4) 

If W were equal to a constant for |r| < r sp h and to zero elsewhere, £/ g would be the mean value of the correlation function 
£f g within a sphere of radius r sp h- After writing £/ s as the Fourier transform of the cross-spectrum and applying Fubini's 
theorem, equation A4 becomes 

£ A « fd 3 kP fg (k)W^(k) (A5) 



or 



DC 



d 3 k(f k (k)gi(k))WZ(k) 



(A6) 



where IVfc(k) is the Fourier transform of W(r). If IVfc(k) is real and positive, TVfc(k) 1 / 2 can be absorbed into each of fk 
and gk in equation A6, leading through equation Al to the inequality 

el < £//£ 99 (a?) 

which holds only if the correlation functions are averaged in a suitably chosen way. Because the Fourier transform of a 
Gaussian centered on the origin is real and positive, correlation functions averaged within a Gaussian volume must obey 
inequality A7. 

We have assumed so far that / and g are smoothly varying functions. In fact galaxies and CIV systems are (nearly) 
discrete objects; if we divided all space into small cells, we would find an occasional cell that contained one galaxy and/or 
one CIV system but most cells would be empty. How does this affect inequality A7? It depends. Imagine underlying and 
unobserved continuous functions /' and g' defined such that the mean number of observed particles in places where the 
continuous functions take on a given value is equal to that value: E(f\f) = /', E{g\g') = g' , with E{x\y) the expectation 
value of x given y. The continuous functions will clearly satisfy inequality A7. Peebles (1980, §33) has shown that the 
auto-correlation functions of / and g are identical to the auto-correlation functions of /' and g' . Their cross-correlation 
function depends on (fg), which may be written 

(fg) = J df'dg' P(f'g')E(fg\f'g') (A8) 
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where P(f'g') is the probability that the functions /', g' simultaneously take on the values /', g' and E(fg\f'g') is the 
expectation value of the product fg given the values of /' and of g 1 . If P(fg\f'g') = P(f\f')P(g\g'), then E(fg\f'g') = 
E(f\f')E(g\g') — f'g' an d substitution into equation A8 shows that (fg) = (fg 1 )- This implies that the cross-correlation 
function of / and g will be equal to the cross-correlation function of /' and g'. Since the auto-correlation functions are 
equal as well, we conclude that particle distributions / and g related via this Poisson process to the continuous functions 
/' and g' will obey inequality A7 provided P{fg\f'g') = P(f\f')P(g\g'). It is worth giving some thought to the meaning 
of this condition. We introduced the continuous functions /' and g' as a mathematical aid to calculating the particle 
distributions' auto- and cross-correlation functions, but suppose momentarily that the particle distributions / and g were 
in fact generated from the continuous functions /' and g' through some stochastic process to be specified, for example by 
associating an independently and identically distributed random number with each volume element dV and then placing 
a particle in / in every volume element where the random number was less than f'(r)dV. If P(fg\f'g') = P(f\f')P(g\g'), 
then the job of constructing the particle realizations / and g of the continuous functions /' and g' could be completed 
in the two independent steps of generating / from /' and g from g'. This condition should be appropriate if metal- line 
absorption systems were truly intergalactic, associated with galaxies only to the extent that both traced the same large- 
scale structure, because then one might imagine that the locations of galaxies and of metal-line systems were determined 
primarily by the large scale distribution of matter without much regard to each other. 

It would not be appropriate if there were a stronger correspondence of galaxies to metal-line absorbers. Suppose we 
were asked to design the intergalactic medium for a universe where most metal-line absorption was produced by galaxies' 
supcrwinds, for example. We clearly could not begin until we were told precisely where amid the jumbled large-scale 
structure the galaxies happened to lie. A counterpart of equation A7 for the general particle case with P(fg\f'g') =/= 
P(f\f')P(g\g') can be derived as follows. Our original derivation relied on the assumption that the correlation function 
£(r) and power-spectrum P(k) would be Fourier transforms of each other. When a random field is composed of discrete 
objects, however, the definition of the correlation function is customarily modified to exclude the correlation of an object 
with itself. In this case the power-spectrum is equal to the Fourier transform of the correlation function plus an additive 
constant (see, e.g., Peebles 1980 §41), and equation A7 accordingly becomes 



where n/ is the global number density of particles in / and hf = nj f d 3 r W(r)/W(0), rather than A7. Equation A9 is 
the statistical inequality that must be obeyed by the cross-correlation function of any two arbitrary particle distributions. 
Since n/ > and n g > for the weighting functions W(r) that we are considering, it is less restrictive than equation A7. 
This reflects the obvious fact that if the positions of particles in / and g are allowed to influence each other, we may 
obtain a stronger cross-correlation than otherwise. 

The difference between these two inequalities provides one way of testing whether an observed association of galaxies with 
CIV absorbers requires the observed galaxies to have influenced the intergalactic CIV content 8 in some way: calculate 
the mean value of the cross-correlation function within a Gaussian volume and see if it violates equation A7. This 
was the approach of § 4.3. Perhaps an example more mundane than galaxies and metals can help clarify the difference 
between particle distributions whose cross-correlation satisfies A7 and those whose cross-correlation satisfies A9. Although 
pedestrians, eyeglasses, and fire hydrants all tend to be found in similar places on the earth's surface, the cross-correlation 
function of pedestrians and eyeglasses obeys A9 while (except in very unfortunate circumstances) the cross-correlation 
function of pedestrians and fire hydrants obeys A7. Not all pedestrians wear eyeglasses, and not all eyeglasses are on 
pedestrians, but the locations of at least some eyeglasses are determined by where pedestrians happen to be standing. 
The locations of fire hydrants are not. 

We conclude with two obvious caveats. First, even if the global means satisfied the relationship E(fg\f'g') = 
E(f\f')E(g\g'), the observed means for any small sample might not. Consequently statistical fluctuations will cause 
violations of inequality A7 among small sub-samples of a population that obeys the inequality as a whole. Reasonably 
large samples, far larger than the one in this paper, will be required to make this sort of test convincing. Second, even if 
it were shown that the cross-correlation function of galaxies and metal-line systems does not violate A7, this would not 
justify the conclusion that the observed galaxies and observed metals exerted no influence on each other. It would show 
only that the influence was not dominant on the chosen spatial smoothing scale. 



Suppose we would like to estimate the universal average overdensity of Lyman-break galaxies in cells centered on CIV 
systems from a measurement of {Ni}, the number of Lyman-break galaxies in cells centered on each of the Af c CIV systems 
in our sample. What estimator should we use? If an average of /i galaxies were expected to lie in each CIV system's cell 
in the absence of a correlation between galaxies and CIV systems, a natural choice for the estimator would be 



? gf < &f + l/n/)(£ M + l/n fl ) 



(A9) 
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Or, equally likely a priori, for the intergalactic metal content to have influenced galaxy formation 



30 



Galaxies and Intergalactic Matter at z <~ 3 



the mean of the observed galaxy overdensity (Ni — fij/fi around each of the i CIV systems. But in practice we would 
not expect to observe fi Lyman-break galaxies around each CIV system if galaxies and CIV systems were independently 
distributed; because the selection function of our galaxy survey peaks around z ~ 3.0, more Lyman-break galaxies should 
lie close to a CIV system at z = 3.0 than a CIV system at (say) z = 2.2 or z = 3.5. The expected number of galaxy 
neighbors is not a constant but instead depends on each CIV system's redshift, and we must estimate the global mean 
overdensity not from {Ni} and /i but from {N} and {^i} where Hi is the expected overdensity around the ith CIV system 
given the system's redshift and the shape of our selection function. What estimator should we use now? 

Let £ gc be the (unknown) global mean overdensity of galaxies in cells centered on CIV systems, and let n>i = /Xj(l+£ gc ) be 
the expected number of galaxies in a cell centered on a CIV system that lies at redshift Zi. Approximating the probability 
of observing Ni galaxies when rij were expected with the Poisson distribution, P(Ni\n,i) = e~ ni n^ i /N^., and assuming 
that different cells are independent, one finds that the probability of observing the set of actual {Ni} and expected {nj} 
galaxy numbers in cells surrounding CIV systems is equal to the product of the individual probabilities: 

P({iV i }|{n i })=exp -(l+£ ffC )I> (l+4c) ' Yltf'/M. (B2) 

The £ gc dependence of equation B2 is identical to the £ gc dependence of a single Poisson distribution of the form P(x\x) — 
e~ x x x jx\ with x = YJ i Ni and x = (1 + £ gc ) J2i Mi- Since the Poisson likelihood P(x\x) is maximized for x = x, we can 
see immediately that we should adopt 



e sc = - 1 (B3) 



as the maximum likelihood estimator of £ gc . In practice the assumptions leading to equation B3 are somewhat incorrect — 
different cells are not completely independent, for example — and so the estimator is not optimal; but it provides an 
attractively simple and reasonably good way to correct for the complexities of our selection bias. 

In the notation of equation 16, estimator B3 would be written D c D g / D c R g — 1. The point of this section is not to 
advocate this particular estimator — in our more careful calculations the superior estimator of equation 16 (Landy & Szalay 
1993) was adopted instead — but to justify our practice of estimating £ by summing the number of observed and expected 
pairs in our full sample and then dividing, rather than (e.g.) by averaging together the values of N/ni — 1 observed 
around individual CIV systems or in individual fields. 



APPENDIX C 

SPATIAL CLUSTERING OF LYMAN-BREAK GALAXIES 

Because of peculiar velocities and redshift uncertainties, our estimate of any Lyman-break galaxy's position along the 
line-of-sight is imprecise. This imprecision complicates our attempts to measure the correlation function of galaxies at 
small separations. With a redshift uncertainty of a z ~ 0.0025 (<~ 1.8 Mpc) we cannot estimate the strength of the 
correlation function at separations r < 1 Mpc (say) by counting the number of galaxies that lie within 1 Mpc of each 
other and comparing to the expected number if galaxies were uniformly distributed; the number of galaxies within 1 Mpc 
of each other is something we do not know. 

One way around this problem (e.g., Davis & Peebles 1983) is to count not the number of galaxies whose estimated 
rcdshifts place them within a distance r of each other, but instead the number of galaxies with angular separation rg ± drg 
and redshift separation \Az\ < r z . If r z is significantly larger than each galaxy's positional uncertainty AZ, the resulting 
function of rg, n(rg, < r z ), will be unaffected by the size of our redshift errors. n(rg, < r z ) can then be inverted to produce 
an estimate of the correlation function £(r) that is not corrupted by redshift uncertainties. 

Different schemes can be used to estimate £(r) from n(rg, < r z ) (see, e.g., Davis & Peebles 1983). Here is our approach. 
It is designed for the low signal-to-noise regime where one can only hope to recover the gross features of the correlation 
function. The expected number of galaxy neighbors with angular separation rg and radial separation within ±r z is 

(n(rg, < r z )) = n(l + ±- £ dl £(^/z 2 + rj)^j (CI) 

where n is the number of galaxies we would expect to observe along a similar line randomly placed. If the correlation 
function is a power law, £(r) = (r/ro)~ 7 , dull algebra shows that the expected excess number of pairs is 

c p (r e , < r z ) ee M - l = r jAl B (l/2, ( 7 - l)/2)/ x (l/2, ( 7 - (C2) 

where B and I x are the beta function and incomplete beta function in the convention of Press et al. (1992, §6.4) and 
x ee r1{r 2 z + rg) -1 . We estimate the correlation function parameters ro, 7 by fitting equation C2 to observed number of 
galaxies with similar rcdshifts that lie at angular separation rg. 
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In principle any value of r z can be used, provided r z 3> cr z , but in practice some choices of r z are better than others. 
If r z is too small we will miss some correlated pairs. The incomplete beta function in equation C2 will correct for this 
only imperfectly. If r z is too large our sample will include a needlessly large number of uncorrelated pairs, and statistical 
fluctuations in the number of uncorrelated pairs may obscure the number of correlated pairs. Our choice of r z was designed 
to fall between these two extremes. We took r z to be the greater of 1000kms _1 (l + z)/i/(z) and 7rg. lOOOkms -1 is several 
times larger than our redshift uncertainty, and so the first lower limit helps ensure that redshift errors will not make us 
fail to recognize correlated pairs. The second limit ensures that we will integrate far enough down the correlation function 
to include at least 80% of correlated pairs for 7 £ 1.6. The derived correlation function does not change significantly if 
the lOOOkms -1 is changed by a factor of 2 in either direction or if we adopt (say) Ibrg rather than 7rg as the lower limit 
on r z . 

Figure 23 shows the observed excess galaxy counts as a function of angular separation rg. Overlaid are the expected 
excess galaxy counts at each separation for the best power-law fit to £(r). The data are reasonably consistent with a 
power-law correlation function. ut p was estimated with the Landy-Szalay (1993) estimator (DD — 2DR + RR)/RR; this 
nomenclature and method of generating random catalogs are described in the main body of the text below equation 16. 
The only change is that here DD (e.g.) is the observed number of galaxy pairs with angular separation rg and redshift 
separation \Az\ < r z , rather than the number of pairs with separation rg, r z . As described above, near equation 16, 
the angular locations of objects in the random and real catalogs were the same. This eliminated any artificial clustering 
signal due to angular variations in the fraction of Lyman-break galaxies with measured redshifts, an inevitable result of 
our sparse multislit spectroscopy, but also eliminated any contribution from the true angular clustering of Lyman-break 
galaxies to our estimate of £(r). As the angular correlation uj(8) of Lyman-break galaxies is almost undctectably weak 
(Giavalisco & Dickinson 2001), however, the resulting bias in £(r) should be small. 

The estimates r — 3.96 ± 0.29/i _1 comoving Mpc, 7 = —1.55 ± 0.15 follow from fitting equation C2 to the data 
by minimizing \ 2 with Powell's direction set method (Press et al. 1992; §10.5). The uncertainties were calculated by 
generating a large number of fake realizations of u p by adding to our estimated u> p at each rg a Gaussian deviate with 
standard deviation equal to the observed uncertainty, then fitting these fake realizations of u p to equation C2 with Powell's 
method. 68.3% of the fake realizations had best fit parameter values in the range listed above. 

We should emphasize that figure 23 provides little support for or against Porciani & Giavalisco's (2002) claim that the 
correlation function of Lyman-break galaxies becomes negative at very small separations. Any small-scale anticorrclation 
would be washed out in the redshift direction by our redshift uncertainties and by the integration of equation CI; any 
anti-correlation in the angular positions of galaxies would be missed in our analysis because of the way we generated our 
random catalogs. 

In any case, the correlation function estimated here agrees well with the clustering strength estimated from a counts- 
in-cells analysis. Using the statistic S of Adelberger et al. (1998) on our current full sample of Lyman-break galaxies, we 
estimate the relative variance of galaxy number density in cubical cells of side-length 11.6/i _1 Mpc (Qm = 0.3, £1a = 0.7) 
to be C7g al = 0.8 ± 0.2, which corresponds roughly to a correlation length of r = 4.65 ± 0.75/i _1 comoving Mpc for 
7 = -1.55 (see Adelberger et al. 1998). 
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